Transcriptional regulator

ABSTRACT

Genes each encoding a novel transcriptional regulator having a bromodomain have been successfully isolated from a human testis cDNA library using primers prepared based on an EST sequence found using the bromodomain sequence of the transcriptional regulator. These genes are structurally analogous to each other.

[0001] This is a continuation-in-part of PCT/JP98/01783, filed Apr. 17,1999, and claims priority from Japanese Application Nos. 9/116570, filedApr. 18, 1997, and 9/310027, filed Oct. 24, 1997.

TECHNICAL FIELD

[0002] The present invention relates to a novel transcriptionalregulator containing a bromodomain and a gene encoding it.

BACKGROUND ART

[0003] The bromodomain is a characteristic motif of proteins found intranscriptional regulators. Proteins having a bromodomain usuallycontain one or two (Tamkun, J. W. et al., (1992), Nuc. Acids Res.,20:2603), but sometimes as many as five bromodomain motifs (Nicolas, R.H. and Goodwin, G. H. (1996), Gene, 175 (12):233-240). This motif isfound in a wide variety of animals. For example, it is identified inyeast (Winston, F. et al., (1987), Genetics, 115:649-656; Laurent, B. C.et al., (1991), Proc. Natl. Acad. Sci. USA, 88:2687-2691), in Drosophila(Digan, M. E. et al., (1986), Dev. Biol., 114:161-169; Tamkun, J. W. etal., (1992), Cell, 68:561-572), and in the genes for transcriptionalregulators in mammals (Denis, G. V. and Green, M. R. (1996), Genes andDevel., 10:261-271; Yang, X. J. et al., (1996), Nature, 382:319-324).

[0004] All transcriptional regulators containing a bromodomain serve tocontrol signal-dependent transcription in actively proliferating cells(Tamkun, J. W. et al., (1992), Cell, 68:561-572; Haynes, S. R. et al.,(1992), Nuc. Acids Res., 20:2603). Due to this feature of thesetranscriptional regulators, it is suggested that cancer may develop ifthe gene for the protein containing a bromodomain is not normallycontrolled. In fact, several studies have shown that humantranscriptional regulators with a bromodomain RING3, p300/CBP, and PCAFmay be involved in oncogenesis.

[0005] RING3 is a transcriptional regulator highly homologous with thefsh protein that regulates development of Drosophila (Haynes, S. R. etal., (1989), Dev. Biol., 134:246-257). RING3 is a nuclearserine/threonine kinase having autophosphorylating activity. Thisactivity of RING3 correlates with a proliferating state in chronic oracute lymphocytic leukemia. For instance, when Denis and Green collectedlymphocytes of peripheral blood from 10 patients suffering fromleukemia, kinase activity associated with RING3 was identified in all ofthe 10 patients but not in normal controls (Denis, G. V. and Green, M.R. (1996), Genes and Develop., 10:261-271). Furthermore, this activitywas not detected in the blood cells from patients whose leukemia hadremitted by virtue of chemotherapy.

[0006] p300 and CBP (CREB binding protein) encode highly similarproteins and are thus often called p300/CBP. p300/CBP is a co-activatorfor a transcriptional regulator CREB (cAMP responsive element bindingprotein) (Kwok, RPS et al., (1994), Nature, 370:223-226), and isconsidered as a key protein for growth regulation. Mutation in p300/CBPhas been found in familial or sporadic cancers. Germline mutation of CBPresults in Rubinstein-Taybi syndrome, which causes patients to developvarious malignant tumors (Petrij, F. et al., (1995), Nature,376:348-51), while mutation in p300 is found in sporadic colorectal andgastric cancers (Muraoka, M. et al., (1996), Oncogene, 12:1565-1569).Furthermore, CBP is fused with MOZ (Monocytic leukemia Zinc fingerprotein) in a t (8; 16) (p11; p13) translocation found in a certainkinds of acute myelocytic leukemia. The fusion protein hashistone-acetyltransferase domains derived from both genes (Bannister, A.J. and Kouzarides, T. (1996), Nature, 384:641-643; Orgyzco, V. V. etal., (1996), Cell, 87:953-959; Brownwell, J. E. and Allis, C. D. (1996),Curr. Opin. Genet. Devel., 6:176-184). Since acetylated histone is knownto be associated with transcriptionally active chromatin, the fusionprotein may be involved in leukemogenesis by way of aberrant histoneacetylation (Brownwell, J. E. and Allis, C. D. (1996), Curr. Opin.Genet. Devel., 6:176-184).

[0007] p300/CBP is also considered to be associated with cancer since itinteracts with known oncogene products. For example, p300/CBP binds toElA protein (Arany, Z. et al., (1995), Nature, 374:81-84), one of theearly genes of adenovirus. p300 is also a co-activator for transcriptionfactors, c-Myb (Dai, P. et al., (1996), Genes Dev., 10:528-540) andc-Fos (Bannister, A. J. and Kouzarides, T. (1996), Nature, 384:641-643).

[0008] PCAF, is considered to inhibit the interaction of E1A withp300/CBP by competing with E1A for binding to p300/CBP (Yang, X. J. etal., (1996), Nature, 382:319-324). PCAF also hashistone-acetyltransferase activity.

[0009] Thus, it is thought that transcriptional regulators containing abromodomain are involved in regulation ofcell growth, and that theiraberrant regulation may be closely related to various diseases,particularly to cancer. Transcriptional regulators containing abromodomain have thus recently received much attention as novel targetsfor specifically treating cancer.

DISCLOSURE OF THE INVENTION

[0010] The objective of the present invention is to provide a noveltranscriptional regulator containing a bromodomain and a gene encodingit, and a method of screening for a candidate compound as a medicamentby using them.

[0011] As a result of research to achieve the above objective, theinventors successfully isolated several genes, each of which encodes anovel transcriptional regulator containing a bromodomain. The genes wereisolated from a human testis cDNA library using primers designed basedon EST sequences which had been identified using known bromodomainsequences as probes. In addition, the inventors have found that thestructures of the isolated genes resemble one another, thus theyconstitute a family. The inventors have also found that the isolatedgenes or proteins encoded by them can be used to screen the candidatecompounds for a medicament that controls the activity of the proteins orother factors interacting therewith.

[0012] Thus, the present invention relates to novel transcriptionalregulators each having a bromodomain and the genes encoding them, and toa method of screening for a candidate compound as a medicament usingsaid proteins or genes, and more specifically relates to:

[0013] (1) a transcriptional regulator having a bromodomain, whichcomprises the amino acid sequence shown in SEQ ID NO:1, 13, 21, 27, or29, or said sequence wherein one or more amino acids are substituted,deleted, or added;

[0014] (2) a transcriptional regulator having a bromodomain, which isencoded by DNA hybridizing with DNA comprising the nucleotide sequenceshown in SEQ ID NO:2, 14, 22, 28 or 30;

[0015] (3) DNA coding for the transcriptional regulator according to (1)or (2);

[0016] (4) a vector comprising the DNA according to (3);

[0017] (5) a transformant expressibly retaining the DNA according to(3);

[0018] (6) a method for producing the transcriptional regulatoraccording to (1) or (2), which comprises culturing the transformantaccording to (5);

[0019] (7) an antibody binding to the transcriptional regulatoraccording to (1) or (2);

[0020] (8) a method of screening a compound having binding activity tothe transcriptional regulator according to (1) or (2), wherein themethod comprises contacting a sample with the transcriptional regulatoraccording to (1) or (2) and selecting a compound having binding activityto the transcriptional regulator according to (1) or (2);

[0021] (9) a compound having binding activity to the transcriptionalregulator according to (1) or (2), which can be isolated according tothe method of (8);

[0022] (10) the compound according to (9), which is naturally occurring;and

[0023] (11) DNA specifically hybridizing with DNA comprising thenucleotide sequence shown in SEQ ID NO:2, 14, 22, 28, or 30 and havingat least 15 nucleotides.

[0024] Here, the term “transcriptional regulator(s)” means protein(s)that control gene expression, and “bromodomain” means an amino acidmotif conserved among the transcriptional regulators associated withsignal-dependent transcription, wherein said motif is involved inprotein-protein interaction.

[0025] The present invention relates to novel transcriptional regulatorshaving a bromodomain (BAZ family). The nucleotide sequences of cDNAisolated by the inventors, which belong to BAZ family, are shown in SEQID NO:2 (BAZ(BAZ1α)), SEQ ID NO:14 (BAZ2α), SEQ ID NO:22 (BAZ2β), andSEQ ID NO:28 and 30 (BAZ1β). The amino acid sequences of proteinsencoded by the cDNA are also shown in SEQ ID NO:1 (BAZ(BAZ1α)), SEQ IDNO:13 (BAZ2α), SEQ ID NO:21 (BAZ2β), and SEQ ID NO:27 and 29 (BAZ1β).

[0026] The bromodomain is characteristic of a structural region that isconserved among a group of transcriptional regulators involved insignal-dependent transcription (Tamkun, J. W. et al., (1992), Cell,68:561-572; Haynes, S. R. et al., (1992), Nuc. Acids Res., 20:2603), andit has been reported that the six mammalian genes, i.e., RING3,p300/CBP, PCAF, BRG1, HRX/ALL-1, and TIF1, which encode transcriptionalregulators having a bromodomain, are associated with cancer. That thetranscriptional regulators having a bromodomain are commonly associatedwith cancer suggests that the genes isolated by the inventors are alsoassociated with cancer. Other than a bromodomain motif, the proteinsencoded by the genes isolated by the inventors share the characteristicmotifs of (1) C4HC3 zinc-finger, which is found in the proteinsexpressed in a wide range of organisms from yeast to human and isbelieved to be involved in a protein-protein interaction or nonspecificbinding to DNA; (2) leucine zipper, which is present in manytranscriptional regulators and is known to contribute to form a dimerwith the protein itself or other proteins (Busch, S. J. andSassone-Corsi, P. (1990), Trends in Genetics, 6:36-40); (3)LXXLL motif,a motif commonly found among many transcriptional co-activators, whichis shown to be required for mediation of transcription induced by anuclear receptor (Torchia, J. et al., (1997), Nature, 387:677-684;Heery, D. M. et al., (1997), Nature, 387:733-736); and (4) nucleartransport signal, which confers the transporting activity into thenucleus on the proteins synthesized in the cytoplasm.

[0027] The combination of a bromodomain and C4HC3 zinc finger is knownto be associated with several breakpoint genes of leukemia (Tkachuk, D.C. et al., (1992), Cell, 71:691-700; Gu, Y. et al., (1992), Cell,71:701-798; Miki, T. et al., (1991), Proc. Nat. Acad. Sci.,88:5167-5171; Le Douarin B. et al., (1995), EMBO J., 14:2020-2033;Borrow, J. et al., (1996), Nature Genet., 14:33-41). Accordingly, thegenes isolated by the inventors are important candidates for breakpointgenes of cancers.

[0028] The transcriptional regulators of the present invention can beprepared as recombinant proteins generated using a recombinant genetechnique, or as naturally occurring proteins, according to a methodknown to one skilled in the art. The recombinant proteins can beprepared using a method such as incorporating DNA encoding atranscriptional regulator of the present invention (e.g., DNA having thenucleotide sequence shown in SEQ ID NO:2, 14, 22, 28, or 30) into asuitable expression vector, which is then introduced into host cells,and purifying the protein obtained from the transformant. The naturallyoccurring proteins can be prepared using a method such as preparing acolumn which utilizes an antibody obtained from a small animal immunizedwith the recombinant protein prepared as above, and subjecting theextract from a tissue or cells in which a transcriptional regulator ofthe present invention is overexpressed (e.g., testis and cancer cells)to affinity chromatography using said column.

[0029] The present invention also relates to transcriptional regulatorsfunctionally equivalent to the transcriptional regulators of the presentinvention having the amino acid sequence shown in SEQ ID NO:1, 13, 21,27, or 29. A method of introducing mutation into amino acids of aprotein to isolate a protein functionally equivalent to a particularprotein is well known to one skilled in the art. Thus, it is well withinthe art of an ordinarily skilled person to isolate a transcriptionalregulator functionally equivalent to the transcriptional regulators ofthe present invention having the amino acid sequence shown in SEQ IDNO:1, 13, 21, 27, or 29 by appropriately modifying, for example,substituting amino acids without affecting the function of thetranscriptional regulator. Mutation in an amino acid of a protein canalso occur spontaneously. The transcriptional regulators of the presentinvention include those having a bromodomain and the amino acid sequenceof SEQ ID NO:1, 13, 21, 27, or 29 wherein one or more amino acids aresubstituted, deleted, or added. Examples of known methods forintroducing amino acid mutation into the protein are a site-directedmutagenesis system using PCR (GIBCQ-BRL, Gaithersburg, Md.) and asite-directed mutagenesis using oligonucleotides (Kramer, W. and Fritz,H. J. (1987), Methods in Enzymol., 154:350-367). The number ofmutagenized amino acids is usually 50 amino acids or less, preferably 30amino acids or less, more preferably 10 amino acids or less, and mostpreferably three amino acids or less.

[0030] As another method of isolating a functionally equivalent proteinutilizing a hybridization technique (Sambrook, J. et al.,, MolecularCloning 2nd ed. 9.47-9.58, Cold Spring Harbor Lab. press, 1989) is wellknown to one skilled in the art. Based on the DNA sequence encoding thetranscriptional regulator of the present invention shown in SEQ ID NO:2,14, 22, 28, or 30, or the fragment thereof, a person with ordinary skillin the art can isolate DNA highly homologous to said DNA sequences usinga hybridization technique (Sambrook, J. et al., Molecular Cloning 2nded. 9.47-9.58, Cold Spring Harbor Lab. press, 1989) to obtain atranscriptional regulator functionally equivalent to the transcriptionalregulators. The transcriptional regulators of the present inventioninclude those encoded by DNA that hybridizes with DNA comprising the DNAsequence shown in SEQ ID NO:2, 14, 22, 28, or 30, and which contains abromodomain. The hybridization and washing conditions for isolating DNAencoding a functionally equivalent protein are defined as lowstringency: 42° C., 2×SSC, 0.1% SDS; moderate stringency: 50° C., 2×SSC,0.1% SDS; and high stringency: 65° C., 2×SSC, 0.1% SDS. Thetranscriptional regulators obtained by the hybridization technique mayhave amino acid homology of preferably 40% or more, more preferably 60%or more, still more preferably 80% or more, or most preferably 95% ormore, with the transcriptional regulators having the amino acid sequenceshown in SEQ ID NO:1, 13, 21, 27, or 29. In particular, high homology inthe bromodomain sequence is considered significant for the functionassociated with cancer. Functionally equivalent transcriptionalregulators to be isolated may contain, other than a bromodomain, asequence involved in interaction with another protein (e.g.,leucine-zipper or LXXLL motif), a sequence involved in binding to DNA(e.g. zinc finger), or a nuclear transport signal. The presence of thebromodomain in the protein can be identified by searching thebromodomain motif PROSITE database on DNASIS (HITACHI Softwareengineering).

[0031] The present invention also relates to DNA that codes for atranscriptional regulator of the present invention. The DNA of thepresent invention includes cDNA, genomic DNA, and chemically synthesizedDNA, but is not limited thereto as long as it codes for atranscriptional regulator of the present invention. cDNA can beprepared, for example, by designing a primer based on the nucleic acidsequence shown in SEQ ID NO:2, 14, 22, 28, or 30 and performing plaquePCR (see Affara, N. A. et al., (1994), Genomics, 22:205-210). Thegenomic DNA can be prepared according to a standard technique using, forexample, Qiagen genomic DNA kits (Qiagen, Hilden, Germany). The DNAsequence thus obtained can be determined according to a standardtechnique using a commercially available dye terminator sequencing kit(Applied Biosystems) and the like. In addition to applying to theproduction of recombinant proteins as described below, the DNA of thepresent invention may be applied to gene therapy and the like.

[0032] The present invention also relates to a vector into which the DNAof the present invention is inserted. There is no particular limitationsto the vector into which the DNA of the present invention is inserted,and various types of vectors, e.g. for expressing the transcriptionalregulators of the present invention in vivo and for preparingrecombinant proteins, may be used for each purpose. Vectors used forexpressing the transcriptional regulators of the present invention invivo (in particular, for gene therapy) include the adenovirus vectorpAdexLcw and the retrovirus vector pZIPneo. A LacSwitch II expressionsystem (Stratagene; La Jolla, Calif.) is advantageous when mammaliancells, such as CHO, COS, and NIH3T3 cells, are used. An expressionvector is particularly useful for producing a transcriptional regulatorof the present invention. Although there is no particular limitation tothe expression vectors, the following vectors are preferred: pREP4(Qiagen, Hilden, Germany) when E. coli is used; SP-Q01 (Stratagene, LaJolla, Calif.) when yeast is used; and BAC-to-BAC baculovirus expressionsystem (GIBCO-BRL, Gaithersburg, Md.) when insect cells are used. TheDNA of the present invention can be inserted into vectors using astandard method.

[0033] The present invention also relates to a transformant expressiblyretaining the DNA of the present invention. The transformants of thepresent invention include one harboring the above-described vector intowhich the DNA of the present invention is inserted and one having theDNA of the present invention integrated into its genome. The DNA of thepresent invention can be retained in the transformant in any form aslong as the transformant expressibly retains the DNA of the presentinvention. There is no limitation to host cells into which a vector ofthe present invention is introduced. If the cells are used to express atranscriptional regulator of the present invention in vivo, desiredcells may be used as target cells. Cells such as E. coli, yeast cells,animal cells, and insect cells can be used for producing thetranscriptional regulators of the present invention. The vector can beintroduced into the cells by methods such as electroporation and heatshock. Recombinant proteins can be isolated and purified from thetransformants generated for producing the said proteins according to astandard method.

[0034] The present invention also relates to antibodies that bind to thetranscriptional regulators of the present invention. The antibodies ofthe present invention include, but are not limited to, polyclonal andmonoclonal antibodies. Also included are antisera obtained by immunizingan animal such as a rabbit with a transcriptional regulator of thepresent invention, any class of polyclonal or monoclonal antibodies,humanized antibodies generated by gene recombination, and humanantibodies. The antibodies of the present invention can be preparedaccording to the following method. For polyclonal antibodies, antiseracan be obtained by immunizing a small animal, such as a rabbit, with atranscriptional regulator of the present invention, then recovering thefractions that only recognize the transcriptional regulator of thepresent invention through an affinity column coupled with thetranscriptional regulator of the present invention. Immunoglobulin G orM can be prepared by purifying the fractions through a Protein A or Gcolumn. For monoclonal antibodies, a small animal, such as a mouse, isimmunized with a transcriptional regulator of the invention, the spleenis removed from the mouse and homogenized into cells, the cells arefused with myeloma cells from a mouse using a reagent such aspolyethylene glycol, and clones that produce antibodies against thetranscriptional regulator of the invention are selected from theresulting fused cells (hybridoma). The hybridoma obtained is thentransplanted into the abdominal cavity of a mouse, and the ascites arerecovered from the mouse. The obtained monoclonal antibodies can then beprepared by purifying, for example, by ammonium sulfate precipitationthrough a Protein A or G column, by DEAE ion exchanging chromatography,or through an affinity column coupled with the transcriptional regulatorof the invention. Besides being used to purify or detect thetranscriptional regulators of the present invention, the antiobodies ofthe present invention can be applied to antibody therapy.

[0035] The present invention also relates to a screening method for acompound that binds to transcriptional regulators of the presentinvention. The screening method of the present invention includes stepsof contacting a transcriptional regulator of the present invention witha test sample and selecting a compound that has binding activity for thetranscriptional regulator of the present invention. Test samples usedfor the screening include, but are not limited to, a cell extract, asupernatant of the cell culture, a library of synthetic low molecularweight compounds, a purified protein, an expression product of a genelibrary, and a library of synthetic peptides. Methods well known to oneskilled in the art for isolating a compound binding to a transcriptionalregulator of the present invention using the regulator are as follows. Aprotein that binds to a transcriptional regulator of the presentinvention can be screened by West-western blotting comprising steps ofgenerating a cDNA library from the cells expected to express the proteinthat binds to a transcriptional regulator of the present invention(e.g., testis tissue cell and tumor cell lines HL-60, HeLa S3, Raji, andSW480) using a phage vector (λgt11, ZAP, etc.), allowing the cDNAlibrary to express on the LB-agarose plate, fixing the expressedproteins on a filter, reacting them with the transcriptional regulatorof the present invention purified as a biotin-labeled protein or afusion protein with GST protein, and detecting plaques expressing theprotein bound to the regulator on the filter with streptavidin oranti-GST antibody (Skolnik, E. Y., Margolis, B., Mohammadi, M.,Lowenstein, E., Fisher, R., Drepps, A., Ullrich, A. and Schlessinger, J.(1991), Cloning of PI3 kinase-associated p85 utilizing a novel methodfor expression/cloning of target proteins for receptor tyrosine kinases,Cell, 65:83-90). Alternatively, the method comprises expressing in yeastcells a transcriptional regulator of the present invention which isfused with SFR or GAL4 binding region, constructing a cDNA library inwhich proteins are expressed in a fusion protein with the transcriptionactivation site of VP16 or GAL4 from the cells expected to express aprotein that binds to the transcriptional regulator of the presentinvention, introducing the CDNA library into the above-described yeastcells, isolating the cDNA derived from the library from the detectedpositive clones, and introducing and expressing it in E. coli. (If aprotein that binds to the transcriptional regulator of the presentinvention is expressed, a reporter gene is activated by the binding ofthe two proteins. The positive clones can then be identified.) Thismethod can be performed using Two-hybrid system (MATCHMAKER Two-HybridSystem, Mammalian MATCHMAKER Two-Hybrid Assay Kit, MATCHMAKER One-HybridSystem (all from Clontech); HybriZAP Two-Hybrid Vector System(Stratagene) or in accordance with Dalton, S. and Treisman R. (1992),Characterization of SAP-1, a protein recruited by serum response factorto the c-fos serum response element, Cell, 68:597-612). Another methodis to apply a culture supernatant or a cell extract from the cellssuspected to express a protein which binds to the transcriptionalregulator of the present invention onto an affinity column coupled withthe transcriptional regulator of the present invention, and purify theprotein specifically bound to the column.

[0036] Also well known to one skilled in the art are a method ofscreening molecules that bind to a transcriptional regulator of thepresent invention by reacting the immobilized transcriptional regulatorof present invention with a synthetic compound, natural substance bank,or a random phage peptide display library, and a method of screening lowmolecular weight compounds, proteins (or their genes), or peptides whichbind to a transcriptional regulator of the present invention byutilizing the high-throughput technique of combinatorial chemistry(Wrighton, N. C., Farrell, F. X., Chang, R., Kashuyap, A. K., Barbone,F. P., Mulcahy, L. S., Johnson, D. L., Barrett, R. W., Jolliffe, L. K.,Dower, W. J., Small peptides as potent mimetics of the protein hormoneerythropoietin, Science (UNITED STATES) Jul. 26, 1996, 273:458-464;Verdine, G. L., The combinatorial chemistry of nature, Nature (ENGLAND),Nov. 7, 1996, 384:11-13; Hogan, J. C. Jr., Directed combinatorialchemistry, Nature (ENGLAND), Nov. 7, 1996, 384:17-19). The compoundsthus isolated, which bind to a transcriptional regulator of the presentinvention, may be used to treat cancer or other proliferative diseases.When the compounds isolated by the screening method of the presentinvention are used as pharmaceuticals, they can be formulated by a knownpharmacological process. For example, they can be administered to apatient with pharmaceutically acceptable carriers and vehicles (e.g.,physiological saline, vegetable oil, a dispersant, a surfactant, and astabilizer). The compounds may be percutaneously, intranasally,transbronchially, intramuscularly, intravenously, or orallyadministered, depending on their properties.

[0037] The present invention also relates to DNA specificallyhybridizing with DNA coding a protein of the present invention andhaving at least 15 nucleotides. As used herein, “specificallyhybridizing” means that no cross-hybridization occurs between DNAencoding other proteins under conditions of moderate stringency. SuchDNA may be used as a probe for detecting and isolating the DNA encodingthe protein of the present invention, and as a primer for amplifying theDNA encoding the protein of the present invention.

[0038] An “isolated nucleic acid” is a nucleic acid the structure ofwhich is not identical to that of any naturally occurring nucleic acidor to that of any fragment of a naturally occurring genomic nucleic acidspanning more than three separate genes. The term therefore covers, forexample, (a) a DNA which has the sequence of part of a naturallyoccurring genomic DNA molecule but is not flanked by both of the codingsequences that flank that part of the molecule in the genome of theorganism in which it naturally occurs; (b) a nucleic acid incorporatedinto a vector or into the genomic DNA of a prokaryote or eukaryote in amanner such that the resulting molecule is not identical to anynaturally occurring vector or genomic DNA; (c) a separate molecule suchas a cDNA, a genomic fragment, a fragment produced by polymerase chainreaction (PCR), or a restriction fragment; and (d) a recombinantnucleotide sequence that is part of a hybrid gene, i.e., a gene encodinga fusion protein. Specifically excluded from this definition are nucleicacids present in mixtures of different (i) DNA molecules, (ii)transfected cells, and (iii) cell clones: e.g., as these occur in a DNAlibrary such as a cDNA or genomic DNA library.

[0039] The term “substantially pure” as used herein in reference to agiven polypeptide means that the polypeptide is substantially free fromother biological compounds, such as those in cellular material, viralmaterial, or culture medium, with which the polypeptide was associated(e.g., in the course of production by recombinant DNA techniques orbefore purification from a natural biological source). The substantiallypure polypeptide is at least 75% (e.g., at least 80, 85, 95, or 99%)pure by dry weight. Purity can be measured by any appropriate standardmethod, for example, by column chromatography, polyacrylamide gelelectrophoresis, or HPLC analysis.

[0040] A “conservative amino acid substitution” is one in which an aminoacid residue is replaced with another residue having a chemicallysimilar side chain. Families of amino acid residues having similar sidechains have been defined in the art. These families include amino acidswith basic side chains (e.g., lysine, arginine, histidine), acidic sidechains (e.g., aspartic acid, glutamic acid), uncharged polar side chains(e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine,cysteine), nonpolar side chains (e.g., alanine, valine, leucine,isoleucine, proline, phenylalanine, methionine, tryptophan),beta-branched side chains (e.g., threonine, valine, isoleucine) andaromatic side chains (e.g., tyrosine, phenylalanine, tryptophan,histidine).

[0041] As used herein, “percent identity” of two amino acid sequences orof two nucleic acids is determined using the algorithm of Karlin andAltschul (Proc. Natl. Acad. Sci. USA 87:2264-2268, 1990), modified as inKarlin and Altschul (Proc. Natl. Acad. Sci. USA 90:5873-5877, 1993).Such an algorithm is incorporated into the NBLAST and XBLAST programs ofAltschul et al. (J. Mol. Biol. 215:403-410, 1990). BLAST nucleotidesearches are performed with the NBLAST program, score=100, wordlength=12to obtain nucleotide sequences homologous to a nucleic acid molecules ofthe invention. BLAST protein searches are performed with the XBLASTprogram, score=50, wordlength=3 to obtain amino acid sequenceshomologous to a reference polypeptide. To obtain gapped alignments forcomparison purposes, Gapped BLAST is utilized as described in Altschulet al. (Nucleic Acids Res. 25:3389-3402, 1997). When utilizing BLAST andGapped BLAST programs, the default parameters of the respective programs(e.g., XBLAST and NBLAST) are used. See http://www.ncbi.nlm.nih.gov.

[0042] Unless otherwise defined, all technical and scientific terms usedherein have the same meaning as commonly understood by one of ordinaryskill in the art to which this invention belongs. In case of conflict,the present application, including definitions, will control. Allpublications, patent applications, patents, and other referencesmentioned herein are incorporated by reference. The materials, methods,and examples are illustrative only and not intended to be limiting.Other features and advantages of the invention will be apparent from thedetailed description, and from the claims.

BRIEF DESCRIPTION OF THE DRAWINGS

[0043]FIG. 1 compares the domain of BAZ (BAZ1α) with those of otherproteins. In FIG. 1A, the bromodomain of BAZ (BAZ1α) is compared withthat of TFIID from yeast, CCG1 from human, p300, and CBP. In FIG. 1B,C4HC3 Zn finger of BAZ (BAZ1α) is compared with those of U13646,retinoblastoma binding protein RBP2, two species of MOZ, p300, and CBP.The conserved amino acids, cysteine and histidine, are indicated by “*.” In both FIGS. 1A and B, identical amino acids are represented byreverse-contrast letters, and similar amino acids are represented byemphasized letters (also netted).

[0044]FIG. 2 shows a chromosome map of BAZ (BAZ1α). FIG. 2A showsassignments of chromosome 14 based on the analysis of a monochromosomehybrid cell panel using primers B (SEQ ID NO:6) and M (SEQ ID NO:7). Thenumbers 1 to Y in the figure refer to chromosome numbers, and TH refersto total human chromosomes. The 111 bp product was specificallyamplified in the cell line GM10479, monochromosomel for a humanchromosome14. FIG. 2B depicts the location of BAZ (BAZ1α) on chromosome14 as determined by Genebridge 4 radiation hybrid panel analysis.

[0045]FIG. 3 shows the expression of BAZ (BAZ1α) in normal tissues (Lane1, heart; Lane 2, brain; Lane 3, placenta; Lane 4, lung; Lane 5, liver;Lane 6, skeletal muscle; Lane 7, kidney; Lane 8, pancreas; Lane 9,spleen; Lane 10, thymus; Lane 11, prostate; Lane 12, testis; Lane 13,ovary; Lane 14, small intestine; Lane 15, colon (mucous lining); andLane 16, leukocytes in the peripheral blood). The bottom of the figureshows control bands using actin probes.

[0046]FIG. 4 shows the expression of BAZ (BAZ1α) in carcinoma. FIG. 4Adepicts the Northern blot analysis in the carcinoma cell lines (Lane 1,promyelocytic leukemia HL-60; Lane 2, HeLa S3 cells; Lane 3, chronicmyelocytic leukemia K-562; Lane 4, lymphoblastic leukemia MOLT-4; Lane5, Burkitt's lymphoma Raji; Lane 6, large intestine adenocarcinomaSW480; Lane 7, lung carcinoma A549; and Lane 8, melanoma G361). FIG. 4Bshows the RT-PCR analysis of primary lung carcinoma of Lane 10. The toppanel shows the amplified product of 554 bp from BAZ (BAZ1α) gene usingprimers U and N, and the bottom shows the amplified product of 442 bpfrom G3PDH gene using primers G3U and G3L. In the figures, L refers tonormal human lung; T, to normal human testis; and G, to normal humangenomic DNA.

[0047]FIG. 5 compares the conserved domains among BAZ (BAZ1α), BAZ2α,and U13646.

[0048]FIG. 6 shows alignments of the domain of BAZ2αand that of otherproteins. In FIG. 6A, the bromodomain of BAZ2α(BAZ2 in the figure) isaligned with PAZ (BAZ1α) human CCG1, PCAF, and CDP. In FIG. 6B, C4HC3 Znfinger of BAZ2α is aligned with BAZ (BAZlu), U13646, retinoblastomabinding protein RBP2, 2 zinc fingers of MOZ, and p300. The conservedcysteine and histidine are indicated by “* .”

[0049]FIG. 7 shows a chromosome map of BAZ2α. FIG. 7A shows assignmentsof chromosome 12 based on the analysis of a monochromosome hybrid cellpanel using primers D (SEQ ID NO:16) and E (SEQ ID NO:17). The numbers 1to Y in the figure refer to chromosome numbers. The product of 132 bpwas specifically amplified in the cell line GM10868a, a monochromosomefor human chromosome 12. FIG. 7B depicts the location of BAZ2 (BAZ2α) onchromosome 12 as determined by Genebridge 4 radiation hybrid panelanalysis.

[0050]FIG. 8 shows the expression of BAZ2α in normal tissues (A: Lane 1,heart; Lane 2, brain; Lane 3, placenta; Lane 4, lung; Lane 5, liver;Lane 6, skeletal muscle; Lane 7, kidney; Lane 8, pancreas; Lane 9,spleen; Lane 10, thymus; Lane 11, prostate; Lane 12, testis; Lane 13,uterus; Lane 14, small intestine; Lane 15, colon (mucous lining); andLane 16, leukocytes in the peripheral blood). FIG. 8B shows controlsusing actin probe.

[0051]FIG. 9 compares the conserved domains of BAZ2β BAZ1α, BAZ2α,U13646, and BAZ1βS. Each has at least five domains. BAZ1α lacks domainI. The figures on the bars of BAZ2β, BAZ1α, U13646, and BAZ1βS representthe percentage of the homology with BAZ2α. The values in the LH domainrepresent the percentage of the homology with leucine residues. Blackbands in the LH domain indicate where the LXXLL motif is present in allthree BAZ genes. LH, ZF, and BD represent leucine-rich helix domain,C4HC3 zinc finger, and bromodomain, respectively.

[0052]FIG. 10 compares the amino acid sequence of LH domain in BAZ2βwith those of corresponding domains in other proteins. The positions ofconserved leucine residues are indicated by arrows on the sequences.LXXLL motifs are boxed.

[0053]FIG. 11 shows a chromosome map of BAZ2β. The product of 147 bp wasspecifically amplified in the cell line as a monochromosome for humanchromosome 2. This product was amplified by PCR using primers nb7n andnb7ee. The numbers 1 to Y in the figure indicate chromosome numbers. Thelocation of BAZ2β on chromosome 2 was determined by Genebridge 4radiation hybrid panel analysis.

[0054]FIG. 12 shows an analysis of the expression of BAZ2β in normaltissues, carcinoma cell lines, and fetal tissues (Lane 1, heart; Lane 2,brain; Lane 3, placenta; Lane 4, lung; Lane 5, liver; Lane 6, skeletalmuscle; Lane 7, kidney; Lane 8, pancreas; Lane 9, spleen; Lane 10,thymus; Lane 11, prostate; Lane 12, testis; Lane 13, uterus; Lane 14,small intestine; Lane 15, colon (mucous lining); Lane 16, leukocytes inthe peripheral blood; Lane 17, fetal brain; Lane 18, fetal lung; Lane19, fetal liver; Lane 20, fetal kidney; Lane 21, acute leukemia HL-60;Lane 22, HeLa S3 cells; Lane 23, chronic myelocytic leukemia K-562; Lane24, lymphoblastic leukemia MOLT-4; Lane 25, Burkitt's lymphoma Raji;Lane 26, large intestine adenocarcinoma SW480; Lane 27, lung carcinomaA549; and Lane 28, melanoma G361). The sizes of the transcripts areindicated on the right side of the figure.

[0055]FIG. 13 shows the alignments of variable portions of BAZ1βS andBAZ1βL.

[0056]FIG. 14 shows the alignments of N terminal amino acid sequencesfrom BAZ1βS and three other members of the BAZ family. The residues with50% or more sequence homology are indicated by dark shadowed boxes, andthose with 50% or more sequence similarity, by light shadowed boxes.Conserved LXXLL motifs and C4HC3 zinc fingers are indicated on thealignments. Conserved leucine residues in the surrounding region of theLXXLL motif are indicated. The location of a bromodomain motif isindicated by a black line on the alignments.

[0057]FIG. 15 shows the alignments of the amino acid sequences fromBAZ1βS and three other members of the BAZ family (continued from FIG.14).

[0058]FIG. 16 shows the alignments of the amino acid sequences fromBAZ1βS and three other members of the BAZ family (continued from FIG.15.

[0059]FIG. 17 shows the alignments of the amino acid sequences fromBAZ1βS and three other members of the BAZ family (continued from FIG.16).

[0060]FIG. 18 shows the alignments of the amino acid sequences fromBAZ1βS and three other members of the BAZ family (continued from FIG.17).

[0061]FIG. 19 shows a chromosome map of BAZ1β. FIG. 19A shows mapping ofBAZ1β on chromosome seven by monochromosome hybrid cell line panelanalysis. A product of 156 bp was observed to be amplified in the cellline GM10791 by PCR using primers nb3S and nb3T. The numbers 1 to Y inthe figure indicate chromosome numbers. FIG. 19B shows the location ofBAZ1β on chromosome seven as determined by Genebridge 4 radiation hybridpanel analysis. BAZ1β is located between 7q11-21 markers D7S489 andD7S669.

[0062]FIG. 20 shows the expression analysis of BAZ1β in normal tissues.In FIG. 20A, the BAZ1β probe is hybridized with two transcripts in awide range of tissues (Lane 1, heart; Lane 2, brain; Lane 3, placenta;Lane 4, lung; Lane 5, liver; Lane 6, skeletal muscle; Lane 7, kidney;Lane 8, pancreas; Lane 9, spleen; Lane 10, thymus; Lane 11, prostate;Lane 12, testis; Lane 13, uterus; Lane 14, small intestine; Lane 15,colon (mucous lining); and Lane 16, leukocytes in the peripheral blood).FIG. 20B shows controls using an actin probe. In FIG. 20B, the blot inFIG. 20A was used to rehybridize with the actin probe.

[0063]FIG. 21 shows the regions within BAZ2β which are covered byexpression clones. Conserved domains (shadowed boxes) and LXXLL motifs(black lines) are indicated. Positions of the first and the last aminoacids of each domain are indicated on the bar. Clone 1 covers aminoacids 1-190; clone 9, amino acids 1241-1584; and clone 11, amino acids1500-1970.

[0064]FIG. 22 is a photograph of electrophoretic patterns showingSDS-PAGE analysis of GST protein (Lane 1, cell lysate (BAZ2β.1); Lane 2,flow through fraction (BAZ2β.1); Lane 3, purified fusion protein(BAZ2β.1); Lane 4, cell lysate (BAZ2β.11); Lane 5, cell lysate(BAZ2β.9); Lane 6, flow through fraction (BAZ2β.11 ); Lane 7, flowthrough fraction (BAZ2β.9); Lane 8, purified protein (BAZ2β.11); andLane 9, purified protein (BAZ2β.9). The positions of molecular weightmarkers are indicated on the right (kDa).

[0065]FIG. 23 is a photograph of electrophoresis showing Westernanalysis of purified GST-fusion protein (Lane 1, GST; Lane 2,GST-BAZ2β.1; Lane 3, GST-BAZ2β.9; and Lane 4, GST-BAZ2β.11).

BEST MODE FOR IMPLEMENTING THE INVENTION

[0066] The present invention is further illustrated with reference tothe following examples, but is not to be construed to be limitedthereto.

EXAMPLE 1

[0067] Isolation and Analysis of BAZ Gene

[0068] (1) Identification of Novel Genes Each Containing a Bromodomain

[0069] The EST database was searched by means of BLAST using the DNAsequence that encodes the 5′ bromodomain motif of the RING3 gene (SEQ IDNO:3) (Beck, S. et al., (1992), DNA Sequence, 2:203-210), and a numberof ESTs identical to the probe sequence were retrieved. The followingexperiment was then performed for one of those EST, H70181. H70181 hasthe highest homology to transcription activator GCN5 of yeast(Georgakopoulos, T. and Thireos, G. (1992), EMBO, J., 11(11):4145-4152)or human (Candau, R. et al., (1996), Mol. Cell. Biol., 16(2):593-602).

[0070] (2) Isolation of a Full-length Sequence

[0071] To clone a full-length sequence of EST H70181, PCR primers weredesigned; primer U, SEQ ID NO:4/AGAAAAAGACAATCTCCAGAGCA, and primer L,SEQ ID NO:5/GCTGTCATCATGTCGTACCAATTC. The specific product of 129 bpobtained from testis cDNA was amplified by RT-PCR using said primers.The amplified product was directly purified through a QIA Quick (Qiagen)purification column. This PCR product was used as a probe for screeningthe testis cDNA library (Clontech; HL3023a). The probe was labeled with[α-³²P] dCTP by random priming, and purified using a Chromaspin 10Column (Clontech); the cDNA clone obtained was used to re-screen thelibrary. This process was repeated until a series of overlapped cloneswas obtained, and thus a full-length sequence was obtained. The isolatedsequence was 5,934 bp in total. The isolated gene was designated “BAZ”(Bromodomain, Atypical Zinc finger). BAZ has an open reading frame (ORF)(SEQ ID NO:1) coding for 1,674 amino acids from the nucleotide positions125-5147. The ORF is followed by 787 bp of a 3′ untranslated region andterminated with a poly-A tail. The polyadenylation signal (AATAAA) islocated at 21 bp upstream from the poly-A tail. The nucleotide sequencetogether with the deduceded amino acid sequence therefrom is shown inSEQ ID NO:2.

[0072] The filter screening of the library was performed in ExpressHybhybridization solution (Clontech) at 65° C. for 1 hour. The filter wasthen washed until a final stringency of 1×SSC and 0.1% SDS at 65° C. wasattained. All the sequencing was performed on automated sequencingapparatus ABI 377 (Perkin Elmer, Norwalk, Conn.), utilizing ABI dyeterminator chemistry.

[0073] (3) Identification of the Homology and Motifs Characterizing theTranscription Factors

[0074] A protein database search using the amino acid sequence of BAZrevealed that a protein encoded by a continuous 2.2 Mb gene sequence ofthe chromosome III of C. Eelegans (Wilson, R. et al., (1994), Nature,368:32-38) is most similar (46% similarity and 23% identity). The sameregions having similarity were found in various transcription factorssuch as the 250 KD subunit of TFIID (Ruppert, S., Wang, E., and Tjian,R. (1993), Nature 362:175-179) and p300/CBP (Eckner, R. et al., (1994),Genes Dev., 8(8):869-884; Chrivia, J. C. et al., (1993), Nature,365:855-859). A motif search of amino acid sequences in the PROSITEdatabase on DNASIS (HITACHI Software Engineering Co.) identified asingle bromodomain (amino acid residues 1569-1627 of SEQ ID NO:1). Thesequence of this bromodomain, together with those of other bromodomains,is shown in FIG. 1A. A BLAST search using C4HC3 Zn finger (C4HC3ZF),which is the motif conserved among a great variety of proteins such asU13646, identified retinoblastoma binding protein RBP2 (Fattaey, A. R.et al., (1993), Oncogene, 8:3149-3156), MOZ (Borrow, J. et al., (1996),Nature Genet., 14:33-41), and p300/CBP(Koken, M. H. et al., (1995), CRAcad. Sci. III, 318:733-739), a motif of 45 amino acids (correspondingto amino acid residues 1269-1313 of SEQ ID NO:1). C4HC3ZFs present inthese genes are shown in FIG. 1B. The function of BAZ as atranscriptional regulator is implied by the fact that it is similar toseveral transcriptional regulators, in particular, a bromodomain motifconserved together with C4HC3ZF and p300/CBP. The similarity of BAZ top300/CBP is not limited to C4HC3 zinc finger and bromodomain regions;well conserved regions are also found adjacent to the bromodomain.Homology was not found between BAZ and histone-acetyltransferase domain,and between BAZ and other domains in which p300/CBP is present. However,BAZ potentially has HAT activity since the histone-acetyltransferasedomain is not well conserved among proteins.

[0075] Several sorts of sequence motifs characterized by the nuclearproteins were identified at 11 sites by employing the PSORT program(http://psort.nibb.ac.jp) utilizing a wide variety of conserved nuclearlocalization sequences.

[0076] (4) Mapping of BAZ

[0077] Primers B SEQ ID NO:6/AACACAAGTGAAGCAAAAGCTGGA and M SEQ IDNO:7/GTGGTGTGCTAACTTGGTCCACAT (obtained from the 3′ end of the gene)were used to amplify DNA obtained from each of the 24 monochromosomes ofhuman/rodent somatic cell lines available from Coriell CellRespositories, New Jersey (Dubois, B. L. and Naylor, S. (1993),Genomics, 16:315-319). The expected product of 111 bp was amplified onlyfrom GM10479, a monochromosomal cell line for human chromosome 14 (seeFIG. 2A). Primers B (SEQ ID NO:6) and M (SEQ ID NO:7) were subsequentlyused for PCR onto a GeneBridge 4 radiation hybrid panel (ResearchGenetics, Huntsville, Ala.). The binary codes generated by assessingwhether each hybrid is positive or negative for amplification werecompared with the analogous codes for the markers constituting aframework map, using the server located athttp://www-genome.wi.mit.edu./cgi-bin/contig/rhmapper.pl. This step wasrepeated using primers W: SEQ ID NO:8/CCCATCGTGAGTCAAGAGTGTCTGT and X:SEQ ID NO:9/CTCGCTTCTACCTTTTTATTGGCT (from the 5′ end of the gene).Based on the pattern obtained from this panel by identifying theamplification in the panel, BAZ was proved to be located on the 14qbetween the two markers D14S730 and D14S75 (see FIG. 2B).

[0078] (5) Analysis of BAZ Expression in the Normal Tissues

[0079] The probe as synthesized in Example 1(2) by amplifying cDNAs fromtestis with PCR using the above-described primers U (SEQ ID NO:4) and L(SEQ ID NO:5) was used for Northern blot analysis of 16 panels of normaltissues. The probe hybridized with a single species of mRNA of 7.0 kb,which corresponds to the length of the ORF identified from the sequenceof the gene. Though this transcript was expressed in almost all thetissues at very low levels, it was expressed in testis at a relativelyhigh level (see FIG. 3). The transcript was not detectable in brain,lung, liver, kidney, and colon (it was possibly expressed atnon-detectable levels). A slightly smaller transcript of 6.5 kb was alsodetected only in testis at a low level. Since the cells divide morevigorously in the testis than any other tissue examined, the expressionpattern is thought to correspond to the role for BAZ during activeproliferation. Hybridization for Northern blot analysis was performed inExpress Hyb hybridization solution (Clontech) at 65° C. for 1 hour. Thefilters were washed until the final stringency reached 1×SSC and 0.1%SDS at 65° C. Imaging was performed using a Fuji BAS Image Analyzer(Fuji Photo Film).

[0080] (6) Analysis of BAZ Expression in Tumor

[0081] That BAZ is highly expressed in the testis suggests thepossibility of its high level expression in vigorously proliferatingtumors. Thus, Northern analysis of eight panels of tumor cell lines wascarried out, using the same probe as used in Example 1(5). As a result,the transcript of 7.0 kb alone was hybridized with the probe as when thenormal tissue was used. Compared with most normal tissues, however, thetranscriptional levels are remarkably higher in most of the tumor celllines (see FIG. 4A). Specifically, the expression was higher in thetumor cell lines HL-60, HeLa S3, Raji, and SW480. In contrast, theexpression levels in K-562, MOLT-4, A549, and G361 are almost the sameas those in normal tissues.

[0082] RT-PCR was used to examine the expression of BAZ in the primarylung carcinomas as shown in Table 1. TABLE 1 Sample Patients No. age sexCarcinoma 1 49 male papillary adenocarcinoma 2 63 male papillaryadenocarcinoma (moderately differentiated) 3 60 male papillaryadenocarcinoma (poorly differentiated) 4 70 male squamous cell carcinoma(fusiform cell variant) 5 76 male papillary adenocarcinoma 6 65 malelarge lung cell carcinoma (moderately differntiated) 7 77 male squamouscell carcinoma (poorly differentiated) 8 45 male acinic adenocarcinoma 950 male carcinoid tumor 10 66 male choriocarcinoma

[0083] Each of the 10 samples was amplified using the primers G3U SEQ IDNO:10/TCATCATCTCTGCCCCCTCTGTCTG and G3L SEQ IDNO:11/GACGCCTGCTTCACCACCTTCTTG, which are the primers for amplifying 442bp of a house-keeping gene G3PDH, and the primers U, SEQ ID NO:4 and N,SEQ ID NO:12/TCATGTGGTCAATCAATTGTTTGT, which are primers for BAZ (seeFIG. 4). G3PDH was used to determine that an equal amount of mRNA waspresent in each sample.

[0084] The primers for BAZ were selected to specifically amplify thecDNA but not genomic DNA. The amplified product was definitely detectedin the sample from the testis and the two lung tumors, but not from theother eight samples from the lung tumor or the normal lungs.

[0085] RT-PCR was performed according to a standard technique in whichtotal RNA was extracted according to the AGPC method (Chomczynski, P.and Sacchi, N. (1987), Analytical Biochem., 162:156-159), thensingle-stranded DNA was synthesised with an oligo (dT15) primer and MMLVreverse transcriptase, a part of which was used for the RT-PCR. TheRT-PCR was performed using AmpliTaq gold (ABI), with 27 cycles ofannealing at 60° C. to amplify G3PDH and 33 cycles of annealing at 55°C. to amplify BAZ. The conditions for hybridization and imaging were thesame as in Example 1(5).

EXAMPLE 2

[0086] Isolation and Analysis of BAZ2α Gene

[0087] (1) Identification of a Novel Gene Containing a Bromodomain andIsolation of its Full-length Sequence

[0088] The DNA encoding the bromodomain of BAZ is highly homologous tothat of GCN5. The DNA sequence encoding the bromodomain motif of humanGCN5 gene (Candau, R. et al., (1996), Mol. Cell. Biol., 16 Q):593-602)was used to search the EST database using BLAST. The Motif search wasperformed using PROSITE. Proteins were compared using Bestfit in GCG.The nuclear transport signal was identified using PSORT. As a result, anumber of ESTs were found to be identical to the probe sequence. Amongthem, an EST (Accession Number: N76552) obtained from a fetalliver/spleen cDNA library proved to be a novel gene.

[0089] To start cloning the full-length sequence of EST N76552, PCRprimers were designed to amplify a particular product of 91 bp from thetestis cDNA library; primer NB16U (SEQ IDNO:15/TGACTCTGAAGTAGGCAAGGCTGG) and primer NB16L (SEQ IDNO:16/CTTGCCTCACAGATTGGCCTGT). The PCR product was used as a probe toscreen the testis cDNA library (Clontech; HL3023a). The amplifiedproduct was directly purified through a QIA Quick (Qiagen) purificationcolumn. The cDNA clone having sequences corresponding to EST was used tore-screen the library.

[0090] This process was repeated until a series of overlapped cloneshaving a full-length sequence of the complete coding region wasobtained. All the sequencing was performed with automated sequencingapparatus ABI 377 (Perkin Elmer, Norwalk, Conn.), utilizing ABI dyeterminator chemistry. As a result, a continuous sequence consisting of9,408 bp nucleotides in total size was generated. Theoreticaltranslation of this sequence showed a presence of methionine codon atthe nucleotide position of 740. An open reading frame (ORF) coding 1878amino acids starts from this position and terminates at the nucleotideposition 6373. The ORF is followed by a 3′ untranslated regionconsisting of at least a 3 kb nucleotide sequence. The nucleotidesequence of the cDNA obtained is shown in SEQ ID NO:14, and the aminoacid sequence deduced from the cDNA is shown in SEQ ID NO:13. Theisolated clone was designated BAZ2α.

[0091] (2) Identification of the Homology and Motifs Characteristic ofTranscription Factors

[0092] Like BAZ, BAZ2α was shown to have the highest homology with theprotein encoded by a C. elegans bromodomain gene U13646, the geneforming a part of a continuous 2.2 Mb segment of chromosome III of C.elegans (Wilson, R. et al., (1994), Nature, 368:32-38) by searching theprotein databases with the amino acid sequence of BAZ2α. The regionswhich showed similarity were identified using various transcriptionfactors such as the 250 KD subunit of TFIID (Ruppert, S., Wang, E. andTjian, R. (1993), Nature 362:175-179) and p300/CBP (Eckner, R. et al.,(1994), Genes Dev., 8(8):869-884; Chrivia, J. C. et al., (1993), Nature,365:855-859). The bromodomain was located between the amino acidresidues 1788 and 1846. The alignments of BAZ2α, BAZ, and U13646 areshown in FIG. 5. The alignments of the sequence of the BAZ2u bromodomainand those of other bromodomains are shown in FIG. 6A. Moreover, a singlemotif consisting of 45 amino acids (amino acid residues 1652-1696) wasidentified. This motif codes C4HC3 Zinc finger (C4HC3ZF), a motifconserved among a large number of proteins such as BAZ, U13646,retinoblastoma binding protein RBP2 (Fattaey, A. R. et al., (1993),Oncogene, 8:3149-3156), MOZ (Borrow, J. et al., (1996) Nature Genet.,14:33-41), and p300/CBP (Koken, M. H. et al., (1995) CR, 4 cad. Sci.III, 318:733-739) by BLAST searching. The alignments of C4HC3ZF fromthese genes are shown in FIG. 6B. BAZ2α resembles BAZ, which suggeststhe possibility that the two proteins closely relate and form a part ofa protein family having a similar function. Like BAZ, BAZ2α whollyresembles several transcription factors and has C4HC3ZF and bromodomainmotifs conserved among p300/CBP and TIF1, especially indicating that BAZis likely to function as a transcriptional regulator

[0093] An LXXLL motif, which is believed to be required for mediatingtranscription induced by nuclear receptors (Torchia, J. et al., (1997),Nature, 387:677-684; Herry, D. M. et al., (1997), Nature 387:733-736),is located at amino acid residue 872. PROSITE motif searching revealedthat this motif was located at the 3′ end of the leucine zipper (aminoacid residues 852-873). The relative locations of LXXLL, C4HC3, andbromodomain motifs in BAZ2α are remarkably similar to those of U13646and BAZ (FIG. 5). Furthermore, in either case, the LXXLL motif islocated behind the helix structure characterized by conserved lysineresidues existing at regular intervals.

[0094] (3) Mapping of BAZ2α

[0095] To locate BAZ2α on the chromosome, PCR primers D (SEQ IDNO:17/TTGCCGTATTTGGCTGGTATC) and E (SEQ IDNO:18/CATAGAGAAGAGGGCAGGGTTGA), which amplify a fragment of 132 bp, wereused to amplify the DNA from each of the 24 monochromosomes ofhuman/rodent somatic cell lines (Dubois, B. L. and Naylor, S. (1993),Genomics, 16:315-319) obtained from Coriell Cell Respositories (NewJersey). The BAZ2α-containing region was identified using 91 GeneBridge4 radiation hybrid panels (Walter M. A. et al., (1994), Nature Genetics,7:22-28). These panels were screened by PCR using primers D and E again.The binary codes generated by assessing whether each hybrid is positiveor negative for amplification were compared with the analogous codes forthe markers constituting a framework map, using the server located athttp://www-genome.wi.mit.edu/cgi-bin/contig/rhmapper.pl. BAZ2α was thusproved to be located 12q24.3-ter from D12S367(see FIG. 7B).

[0096] (4) Analysis of the BAZ2α Expression

[0097] The probe (a 481 bp fragment of BAZ2α gene) prepared byamplifying the DNA from one of the clones obtained from the testis cDNAlibrary (Clontech)in Example 2(1) using primers gt10F (SEQ IDNO:19/CTTTTGAGCAAGTTCAGCCT) and NB16N (SEQ IDNO:20/GTCGGCTTCTTCATTTCCTCCA) was used for Northern analysis of 16panels of normal tissues (Clontech). The probe was labeled with [α-³²P]dCTP by random priming and purified using a Chromaspin 10 column(Clontech). Hybridization for Northern analysis and library filterscreening were performed in the ExpressHyb hybridization solution(Clontech) at 65° C. for 1 hour. The filters were then washed until thefinal stringency reached 0.5×SSC and 0.1% SDS. Imaging was performedusing a Fuji BAS Image Analyzer. The result showed that this probe washybridized with a single species of mRNA of 10.5 kb in almost all thetissues; this length corresponds to that of ORF identified from the genesequence (FIG. 8). The transcript was expressed in almost all thetissues at a low level. Another transcript of 9.0 kb was detected andwas primarily expressed in the testis. This band survived thehigh-stringent wash. The second transcript is thought to be analternatively spliced form of, or a different gene closely related to,BAZ2α.

EXAMPLE 3

[0098] Isolation and Analysis of BAZ2α Gene (1) Identification of aNovel Gene Containing a Bromodomain and Isolation of its Full-lengthSequence

[0099] A BLAST search was performed against the EST databases using thevarious nucleotide sequences containing a known bromodomain motif.Several ESTs which may encode the bromodomain gene were identified basedon the result of the search using the nucleotide sequence of the SMAPgene (Nielsen, M. S. et al., (1996), Biochem. Biophys. Acta). Amongthem, an EST (Gnbank Accession Number: AA015589) obtained from a retinalcDNA library was proved to be a novel gene, the protein deduced fromwhich has the highest homology with BAZ2α.

[0100] Its full-length nucleotide sequence was isolated. The full-lengthgene for EST AA015589 was cloned as follows. First, PCR primers NB7U(SEQ ID NO:23/CTGACTGAAATGGAAACTCATGAGG) and NB7L (SEQ IDNO:24/CTAGAGCAAAGGTTTCAAGGTTTGG) were designed to obtain the specificproduct of 165 bp from the testis cDNA. The amplified product wasdirectly purified with a QIA Quick (Qiagen) purification column. The PCRproduct was used as a probe to screen the testis cDNA library, and thecDNA clone containing the EST sequence was used to re-screen thelibrary. This process was repeated until the nucleotide sequencecovering the whole coding region of the gene was obtained by assemblingthe clones. As a result, a nucleotide sequence consisting of 7,585 bp intotal was obtained. This full-length sequence contains an open readingframe (ORF) consisting of 1972 amino acids (6,282 nucleotides) with ATGat the nucleotide position 367 as the initiation codon, followed by 3′UTR of 1303 bp. SEQ ID NO:22 shows the nucleotide sequence of the cDNAthus obtained, and SEQ ID NO:21 shows the amino acid sequence deducedfrom the nucleotide sequence. All the sequencing was performed onautomated sequencing apparatus ABI 377 (Perkin Elmer, Norwalk, Conn.),utilizing ABI dye terminator chemistry. Hybridization for the libraryfilter screening was performed using ExpressHyb hybridization solution(Clontech) at 65° C. for 1 hour. The filters were washed until the finalstringency reached 0.5×SSC and 0.1% SDS. Subsequently, the filters weresubjected to autoradiography at 70° C. for 1 to 3 days to intensifytheir signals.

[0101] (2) Homology and the Characteristics of the Motifs of theTranscriptional Regulator

[0102] The motifs of the protein encoded by the gene obtained weresearched for in PROSITE. The proteins were compared using Bestfit fromGCG. A nuclear transport signal was identified through PSORT(http://psort.nibb.ac.jp/form.html).

[0103] As for BAZ and BAZ2α, a database search based on the amino acidsequences predicted from the registered genes showed that this gene hasthe greatest similarity to the protein encoded by the bromodomain geneU13646 from the nematode (C. elegans). This nematode bromodomaingenecorresponds to a portion of a 2.2 Mb segment derived from chromosome IIIof the nematode (C. elegans) (Wilson, R. et al., (1994), Nature,368:32-38). The gene, however, shows homology to BAZ and BAZ2α to alarger extent. Actually, the similarity of the protein encoded by thisgene to BAZ and BAZ2α suggests the possibility that these three proteinsare closely related to one another, and, moreover, that they are a partof a broader family of proteins with similar functions. This gene wasdesignated BAZ2β (for bromodomain, atypical zinc finger), since it hasthe greatest association with BAZ2α. BAZ was also renamed BAZ1α. Theamino acid sequence of BAZ2β is shown in FIG. 9 together with those ofBAZ1α, BAZ2α, U13646, and BAZ1βS described below. At least five regionsor domains can be identified from the sequences. The first domain (I) isnot present in BAZ1α, but is in the other three proteins. The existenceof a leucine-rich helical structure (LH) was predicted from the analysisof the next domain. LXXLL motif is present at the central part of thisdomain on all BAZs except U13646. This motif potentially confers theinteraction with the nuclear receptors on the protein (Torchia, J. etal., (1997), Nature, 387:677-684; Heery, D. M. et al., (1997), Nature,387:733-736). Both domains II and III are highly conserved, suggestingtheir functional importance. Each protein has a highly conserved C4HC3zinc finger (Aasland, R. et al., (1995), Trends Biochem. Sci., 20:56-59;Koken, M. H. et al., (1995), CR Acad. Sci. III, 318:733-739; Saha, V. etal.,, (1995), Proc. Natl. Acad. Sci., 92:9737-9741) and a bromodomain.In addition, a conserved region is found upstream from the zinc-fingermotif, and the region can also be functionally important. Similarly,there are conserved sequences upstream from the bromodomain motif. Suchconserved domains are aligned in FIG. 10. Like BAZ1α and BAZ2α, BAZ2βexhibits great similarity to several transcription factors and is thusexpected to function as a transcription factor. Consistent with thisfunction, estimation of the protein localization in the cell using thePSORT program revealed that BAZ2β has 19 consensus nuclear localizationsequences (Robbins, J. et al., (1991), Cell, 64:615-23) in total.

[0104] (3) Chromosomal Mapping of BAZ2α

[0105] To create a chromosome map of BAZ2α, primers nb7n (SEQ IDNO:25/TGTTGCTGCATCACTTGTGTAGTT) and NB7ee (SEQ IDNO:26/GGCATGACAATAATGTCTGCAAA) were prepared and used toamplify the DNAobtained from each of the 24 human/rodent monochromosomal somatic celllines (Dubois, B. L. and Naylor, S. (1993), Genomics, 16:315-319). Theamplification of the 147 bp fragment as expected PCR product indicatedthat the gene was likely to be located on human monochromosome 2 (FIG.11). The locus region of BAZ2β was determined by use of 91 radiationhybrid panels of GeneBridge 4 (Walter, M. A. et al., (1994), NatureGenetics, 7:22-28). The hybrid panels were screened by PCR using primersnb7n and nb7ee again. The binary codes generated by assessing whethereach hybrid is positive or negative for the amplification were comparedwith the analogous codes for the markers constituting a framework map,using the server located athttp://www-genome.wi.mit.edu/cgi-bin/contig/rhmapper.pl to identify thechromosomal locus of this gene. As a result, BAZ2β was confirmed to belocated on chromosome 2q23-24 and between markers D2S1986 and G09369(FIG. 11).

[0106] (4) Analysis of the BAZ2β Expression

[0107] The cDNA probe containing the sequence correponding to nucleotideresidues 1700-4000 was used for Northern analysis of 16 normal tissues,eight tumor cell lines, and four fetal tissues (FIG. 12). The probe waslabeled with [α-³²P] dCTP by random priming and purified on a Chromaspin10 column (Clontech). Hybridization for Northern analysis was performedat 65° C. for 1 hour using ExpressHyb hybridization solution (Clontech).Subsequently, the filters were washed at 65° C. until the finalstringency reached 0.5×SSC and 0.1% SDS. Autoradiography was thenperformed at −70° C. for 1 to 3 days to intensify the signals of thefilters. This probe detected an mRNA of about 9.5 kb, a transcript whosesize agreed with that of the ORF identified from the nucleotidesequence, in almost all the tissues examined. Besides this band, atranscript of about 6.5 kb was predominantly expressed in the testis.Since this band remained unchanged even after the high stringent wash(0.1×SSC, at 65° C.), it was considered to be specifically expressed.The second transcript could be an alternatively spliced product ofBAZ2β), but no clone implying this event was found. It was also likelythat expression of another gene closely related to BAZ2β was detected.In addition to these transcripts, several mRNAs were detected in mosttissues. Such transcripts were considered to be derived from other geneseach having an analogous sequence. Another analysis using another probecontaining a bromodomain revealed the expression of the transcript of6.5 kb only in the testis and of a 8.5 kb transcript in a wide range oftissues.

EXAMPLE 4

[0108] Isolation and Analysis of BAZ1β (BAZ1βS and BAZ1βL) Genes

[0109] (1) Identification of Novel Genes Containing a Bromodomain andIsolation of their Full-length Nucleotide Sequences

[0110] A BLAST search was performed against the EST database using thenucleotide sequence of the bromodomain motif from human GCN5 gene(Candau et al., (1996), Mol. Cell. Biol., 16:593-602). Several ESTspossibly coding a number of bromodomain genes were identified. Amongthem, an EST (Gnbank accession Number: AA01307) derived from a retinalcDNA library was found to be a novel gene.

[0111] Its full-length sequence was isolated. The full-length gene forEST AA01307 was cloned as follows. First, PCR primers nb3U (SEQ IDNO:31/TGGATGATGCTGAGGTGGATGA) and nb3L (SEQ IDNO:24/GGGGTGCTGGATGACATCATAG) were designed to obtain a product of 184bp specific to the primers from a testis cDNA library. The amplifiedproduct was directly purified using a QIA Quick (Qiagen) purificationcolumn. The PCR product was used as a probe to screen the testis cDNAlibrary (Clontech HL3024a), and the cDNA clone containing the ESTsequence was used to re-screen the library. This process was repeatedafter joining the clones. As a result, two types of nucleotide sequenceswere obtained and designated BAZ1β. The two sequences were furtherdesignated BAZ1βS for the shorter sequence and BAZ1βL for the longerone. The shorter sequence consisted of 5,561 nucleotides and encoded aprotein of 1527 amino acids; the longer sequence consisted of 5,573nucleotides and encoded a protein of 1531 amino acids, containing atandem repeat of TACAGACCCTCC in one frame. This repeat gave rise to aninsertion of four amino acids LLQT at position 658, which interestinglyresulted in an additional LXXLL motif. BAZ1βS had four LXXLL motifsinitiated at positions 655, 658, 1000, and 1436, while BAZ1βL had fiveLXXLL motifs initiated at positions 655, 658, 663, 1004, and 1440. FIG.13 shows an alignment of the portions having multiple LXXLL motifs ofBAZ162 S and BAZ1βL.

[0112] To determine whether the variability of the LXXLL motif isattributed to alteration of splicing or polymorphism, a pair of primersconsisting of NB3KK (SEQ ID NO:33/GAGTGCAGATAAGGGTGGCTTTTT) and NB3LL(SEQ ID NO:34/CCAATTCACCATAGTCTTCGGCTA), which correspond to both sidesof the variable region, was prepared and used to amplify genomic DNA andcDNA. As a result, these primers amplified a product of the same sizefrom both of the templates. This implies the sequence variant isgenerated within an intron. Therefore, the variation of the sequence isprobably caused by polymorphism. This may affect the interaction withthe nuclear receptors. The nucleotide sequence of BAZ1βS cDNA thusobtained is shown in SEQ ID NO:28, and the deduced amino acid sequenceof the protein encoded by the cDNA is shown in SEQ ID NO:27. Thenucleotide sequence of BAZ1βL cDNA is also shown in SEQ ID NO:30, andthe deduced amino acid sequence of the protein encoded by the cDNA isshown in SEQ ID NO:29. All the nucleotide sequences were determined withautomated sequencing apparatus ABI 377, using ABI dye terminatorchemistry. Hybridization for the filter screening of the library wasperformed in ExpressHyb hybridization solution (Clontech) at 65° C. for1 hour. The filters were washed at 65° C. until the final stringencyreached 0.5×SSC and 0.1% SDS. Subsequently, the filters wereautoradiographed at −70° C. for 4 days to intensify the signals orautoradiographed for 4 hours with the Fuji BAS system.

[0113] (2) Homology and Characteristics of the Motifs of theTranscriptional Regulator

[0114] The motifs of the proteins encoded by BAZ1βS and BAZ1βL geneswere searched in PROSITE. The proteins were compared using a MAP programof a BCM search launcher(http://dot.imgen.bcm.tmc.edu:9331/multi-align/multi-align.html) underthe default setting conditions; the output results were edited using abox shade program (http://ulrec3.unil.ch/software/BOX_form.htmlat). Anuclear transport signal was identified through PSORT(http://psort.nibb.ac.jp/form.html).

[0115] Several motifs characteristic of transcriptional regulators werefound in both BAZ1βS and BAZ1βL. They were bromodomain, C4HC3 zincfinger (C4HC3ZF), and LXXLL motifs. LLXXLL motifs were present in theleucine-rich domain conserved among other BAZ family member proteingenes and U13646 (FIG. 9). Although the importance of this domain hasnot been clarified, it can form a leucine zipper responsible for forminga dimer of the protein. It has been reported that such motifs arecommonly found in the transcriptional regulators of eukaryotes (Buschand Sassone-Corsi, 1990) and that LXXLL motifs also interact with thenuclear receptors (Torchia et al., (1997), Nature, 387:677-684; Heery etal., (1997), Nature 387:733-736). That the predicted amino acidsequences have extensive similarity to several kinds of transcriptionregulators indicates the possibility that their genes function astranscriptional regulators. This is further supported by the fact that13 nuclear localized consensus sequences (Robbins et al., (1991), Cell,64:615-23 ) were found in total based on the prediction of the cellularlocalization of the proteins using the PSORT program. The predictedamino acid sequences exhibited the highest similarity to BAZlO. Theyalso showed similarity to the proteins encoded by BAZ2α, BAZ2β, and C.elegans bromodomain gene U13646. Among the six domains, the first domainexisted in BAZ2α, BAZ2β, and U13646, but not in BAZ1βS, BAZ1βL, orBAZ1β. Comparing the whole structures of these gene products, the regionbetween domains II and III is the most similar to that of BAZlu (FIGS.14-18). Like other members of BAZ family, these gene products also havemotifs that are present in the protein assumed to be encoded by nematode(C. elegans) bromodomain gene U13646 (Wilson et al., (1994) Nature,368:32-38) that is identified by analyzing genome sequences of thegenes. Alignment of the sequences of BAZlgS, other members of the BAZfamily, and U13646 reveals that the most highly conserved regions arelocated between the center and the C terminus of the sequences (FIGS.14-18). For U13646, this region is not depicted in the figures, and onlyN terminal region is aligned with that of BAZ1βS and BAZ1α.

[0116] (3) Chromosomal Mapping of BAZ1β

[0117] To create a chromosome map of BAZ1β, primers nb3S (SEQ IDNO:35/GAAACGGGAGGAGCTGAAAAAG) and nb3T (SEQ IDNO:36/CCTTCAGGGGTATCCACCAATC) were prepared and used to amplify the DNAobtained from each of the 24 human/rodent monochromosomal somatic celllines (Dubois, B. L. and Naylor, S. (1993), Genomics, 16:315-319). Theexpected PCR product of 156 bp was amplified from GM10791 from twodistinct cell lines, suggesting that the BAZ1β gene is likely to belocated on human chromosome 7 (FIG. 19A). The locus of BAZ1β wasdetermined using 91 radiation hybrid panels of GeneBridge 4 (Walter, M.A. et al., (1994), Nature Genetics, 7:22-28). The hybrid panels werescreened by performing PCR with primers nb3S and nb3T again. The locusof this gene was identified by comparing the binary codes generated byassessing each hybrid as positive or negative for the amplification withthe analogous codes for the markers constituting a framework map usingthe server located athttp://www-genome.wi.mit.edu/cgi-bin/contig/rhmapper.pl. As a result,BAZ1β was confirmed to be mapped on chromosome 7q11-22 and also locatedbetween the markers D7S489 and D7S669 (FIG. 19B).

[0118] (4) Analysis of the BAZ1β Expression

[0119] The cDNA probe of 156 bp prepared by PCR for the testis cDNAusing primers nb3S and nb3T was used for Northern analysis of 16 panelsof normal tissues (FIG. 20). The probe was labeled with [α-³²P] dCTP byrandom priming and purified with a Chromaspin 10 column (Clontech).Hybridization for Northern analysis was performed at 65° C. for 1 hourin ExpressHyb hybridization solution (Clontech). The filters were washedat 65° C. until the final stringency reached 0.5×SSC and 0.1% SDS.Subsequently, autoradiography was performed at −70° C. for 4 days tointensify the signals of the filters or for 4 hours with a Fuji BASsystem. This probe detected an mRNA of 7.5 kb in almost all the tissuesexamined. The transcript was analogous to a 7.0 kb transcript of BAZ1α.

EXAMPLE 5

[0120] Expression and Purification of BAZ2β Fusion Protein

[0121] Three constructs for BAZ2β were prepared with pGEX vector(Pharmacia) used to express fusion proteins in bacteria. Each of thethree constructs contained the sequence corresponding to the amino acidpositions 1-190, 1241-1584, or 1500-1970 of BAZ2α (FIG. 21). Theexpression of the fusion protein was mediated by the IPTG-induciblepromoter located upstream from the cloning site. The expressed proteinswere purified through an affinity matrix containingglutathione-Sepharose beads since the expressed protein was fused toglutathione-S-transferase (GST). Specifically, the GST fusion proteinswere expressed and purified according to the instructions appended toGST purification modules (Pharmacia). The cultured volume was 400 ml,and proteins were induced by 0.1 mM IPTG at 30° C. overnight. Westernblotting was performed using BioRad reagents included in an AlkalinePhosphatase Conjugate Substrate kit, according to the manual appended tothe kit.

[0122] The results of analyzing the expressed protein on the 4-20%gradient SDS-polyacrylamide gel showed that the induced proteins werenot detected in the bacterial cell lysates before purification (FIG. 22,Lanes 1, 4, and 5), indicating that the induction through the promoterwas not strong in any construct. In any case, however, distinctiveproteins (Table 2) with molecular weights corresponding to thosepredicted were detected (FIG. 23, Lanes 3, 8, and 9). To prove that thepurified proteins were the desired fusion proteins, western blot wascarried out using the anti-GST antibody. As a result, purified proteinwith the corresponding size predicted for each protein was detected.TABLE 2 Amino acid Predicted Detected Construct region MWT kDal MWT kDalBAZ2β.1  1-190 51 50 BAZ2β.9 1241-1584 67 65 BAZ2β.11 1500-1970 84 85

Industrial Applicability

[0123] The present invention provides a novel transcriptional regulatorhaving a bromodomain, DNA coding said transcriptional regulator, avector containing said DNA, a transformant expressively retaining saidDNA, an antibody binding to said transcriptional regulator, and themethod of screening a compound binding to said transcriptionalregulator. A transcriptional regulator and DNA of the present inventionare expected to be used as indices to diagnose and treat cancer andproliferative diseases, and to screen a drug with a new actionmechanism. A compound binding to a transcriptional regulator of thepresent invention could also be used as a pharmaceutical to treat thediseases described above.

0 SEQUENCE LISTING <160> NUMBER OF SEQ ID NOS: 72 <210> SEQ ID NO 1<211> LENGTH: 1674 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400>SEQUENCE: 1 Met Glu Asp Ala Ser Glu Ser Ser Arg Gly Val Ala Pro Leu IleAsn 1 5 10 15 Asn Val Val Leu Pro Gly Ser Pro Leu Ser Leu Pro Val SerVal Thr 20 25 30 Gly Cys Lys Ser His Arg Val Ala Asn Lys Lys Val Glu AlaArg Ser 35 40 45 Glu Lys Leu Leu Pro Thr Ala Leu Pro Pro Ser Glu Pro LysVal Asp 50 55 60 Gln Lys Leu Pro Arg Ser Ser Glu Arg Arg Gly Ser Gly GlyGly Thr 65 70 75 80 Gln Phe Pro Ala Arg Ser Arg Ala Val Ala Ala Gly GluAla Ala Ala 85 90 95 Arg Gly Ala Ala Gly Pro Glu Arg Gly Ser Pro Leu GlyArg Arg Val 100 105 110 Ser Pro Arg Cys Leu Cys Ser Gly Glu Gly Gly GlnVal Ala Val Gly 115 120 125 Val Ile Ala Gly Lys Arg Gly Arg Arg Gly ArgAsp Gly Ser Arg Arg 130 135 140 Ala Pro Gly Gly Arg Glu Met Pro Leu LeuHis Arg Lys Pro Phe Val 145 150 155 160 Arg Gln Lys Pro Pro Ala Asp LeuArg Pro Asp Glu Glu Val Phe Tyr 165 170 175 Cys Lys Val Thr Asn Glu IlePhe Arg His Tyr Asp Asp Phe Phe Glu 180 185 190 Arg Thr Ile Leu Cys AsnSer Leu Val Trp Ser Cys Ala Val Thr Gly 195 200 205 Arg Pro Gly Leu ThrTyr Gln Glu Ala Leu Glu Ser Glu Lys Lys Ala 210 215 220 Arg Gln Asn LeuGln Ser Phe Pro Glu Pro Leu Ile Ile Pro Val Leu 225 230 235 240 Tyr LeuThr Ser Leu Thr His Arg Ser Arg Leu His Glu Ile Cys Asp 245 250 255 AspIle Phe Ala Tyr Val Lys Asp Arg Tyr Phe Val Glu Glu Thr Val 260 265 270Glu Val Ile Arg Asn Asn Gly Ala Arg Leu Gln Cys Thr Ile Leu Glu 275 280285 Val Leu Pro Pro Ser His Gln Asn Gly Phe Ala Asn Gly His Val Asn 290295 300 Ser Val Asp Gly Glu Thr Ile Ile Ile Ser Asp Ser Asp Asp Ser Glu305 310 315 320 Thr Gln Ser Cys Ser Phe Gln Asn Gly Lys Lys Lys Asp AlaIle Asp 325 330 335 Pro Leu Leu Phe Lys Tyr Lys Val Gln Pro Thr Lys LysGlu Leu His 340 345 350 Glu Ser Ala Ile Val Lys Ala Thr Gln Ile Ser ArgArg Lys His Leu 355 360 365 Phe Ser Arg Asp Lys Leu Lys Leu Phe Leu LysGln His Cys Glu Pro 370 375 380 Gln Glu Gly Val Ile Lys Ile Lys Ala SerSer Leu Ser Thr Tyr Lys 385 390 395 400 Ile Ala Glu Gln Asp Phe Ser TyrPhe Phe Pro Asp Asp Pro Pro Thr 405 410 415 Phe Ile Phe Ser Pro Ala AsnArg Arg Arg Gly Arg Pro Pro Lys Arg 420 425 430 Ile His Ile Ser Gln GluAsp Asn Val Ala Asn Lys Gln Thr Leu Ala 435 440 445 Ser Tyr Arg Ser LysAla Thr Lys Glu Arg Asp Lys Leu Leu Lys Gln 450 455 460 Glu Glu Met LysSer Leu Ala Phe Glu Lys Ala Lys Leu Lys Arg Glu 465 470 475 480 Lys AlaAsp Ala Leu Glu Ala Lys Lys Lys Glu Lys Glu Asp Lys Glu 485 490 495 LysLys Arg Glu Glu Leu Lys Lys Ile Val Glu Glu Glu Arg Leu Lys 500 505 510Lys Lys Glu Glu Lys Glu Arg Leu Lys Val Glu Arg Glu Lys Glu Arg 515 520525 Glu Lys Leu Arg Glu Glu Lys Arg Lys Tyr Val Glu Tyr Leu Lys Gln 530535 540 Trp Ser Lys Pro Arg Glu Asp Met Glu Cys Asp Asp Leu Lys Glu Leu545 550 555 560 Pro Glu Pro Thr Pro Val Lys Thr Arg Leu Pro Pro Glu IlePhe Gly 565 570 575 Asp Ala Leu Met Val Leu Glu Phe Leu Asn Ala Phe GlyGlu Leu Phe 580 585 590 Asp Leu Gln Asp Glu Phe Pro Asp Gly Val Thr LeuGlu Val Leu Glu 595 600 605 Glu Ala Leu Val Gly Asn Asp Ser Glu Gly ProLeu Cys Glu Leu Leu 610 615 620 Phe Phe Phe Leu Thr Ala Ile Phe Gln AlaIle Ala Glu Glu Glu Glu 625 630 635 640 Glu Val Ala Lys Glu Gln Leu ThrAsp Ala Asp Thr Lys Gly Cys Ser 645 650 655 Leu Lys Ser Leu Asp Leu AspSer Cys Thr Leu Ser Glu Ile Leu Arg 660 665 670 Leu His Ile Leu Ala SerGly Ala Asp Val Thr Ser Ala Asn Ala Lys 675 680 685 Tyr Arg Tyr Gln LysArg Gly Gly Phe Asp Ala Thr Asp Asp Ala Cys 690 695 700 Met Glu Leu ArgLeu Ser Asn Pro Ser Leu Val Lys Lys Leu Ser Ser 705 710 715 720 Thr SerVal Tyr Asp Leu Thr Pro Gly Glu Lys Met Lys Ile Leu His 725 730 735 AlaLeu Cys Gly Lys Leu Leu Thr Leu Val Ser Thr Arg Asp Phe Ile 740 745 750Glu Asp Tyr Val Asp Ile Leu Arg Gln Ala Lys Gln Glu Phe Arg Glu 755 760765 Leu Lys Ala Glu Gln His Arg Lys Glu Arg Glu Glu Ala Ala Ala Arg 770775 780 Ile Arg Lys Arg Lys Glu Glu Lys Leu Lys Glu Gln Glu Gln Lys Met785 790 795 800 Lys Glu Lys Gln Glu Lys Leu Lys Glu Asp Glu Gln Arg AsnSer Thr 805 810 815 Ala Asp Ile Ser Ile Gly Glu Glu Glu Arg Glu Asp PheAsp Thr Ser 820 825 830 Ile Glu Ser Lys Asp Thr Glu Gln Lys Glu Leu AspGln Asp Met Phe 835 840 845 Thr Glu Asp Glu Asp Asp Pro Gly Ser His LysArg Gly Arg Arg Gly 850 855 860 Lys Arg Gly Gln Asn Gly Phe Lys Glu PheThr Arg Gln Glu Gln Ile 865 870 875 880 Asn Cys Val Thr Arg Glu Leu LeuThr Ala Asp Glu Glu Glu Ala Leu 885 890 895 Lys Gln Glu His Gln Arg LysGlu Lys Glu Leu Leu Glu Lys Ile Gln 900 905 910 Ser Ala Ile Ala Cys ThrAsn Ile Phe Pro Leu Gly Arg Asp Arg Met 915 920 925 Tyr Arg Arg Tyr TrpIle Phe Pro Ser Ile Pro Gly Leu Phe Ile Glu 930 935 940 Glu Asp Tyr SerGly Leu Thr Glu Asp Met Leu Leu Pro Arg Pro Ser 945 950 955 960 Ser PheGln Asn Asn Val Gln Ser Gln Asp Pro Gln Val Ser Thr Lys 965 970 975 ThrGly Glu Pro Leu Met Ser Glu Ser Thr Ser Asn Ile Asp Gln Gly 980 985 990Pro Arg Asp His Ser Val Gln Leu Pro Lys Pro Val His Lys Pro Asn 995 10001005 Arg Trp Cys Phe Tyr Ser Ser Cys Glu Gln Leu Asp Gln Leu Ile Glu1010 1015 1020 Ala Leu Asn Ser Arg Gly His Arg Glu Ser Ala Leu Lys GluThr Leu 1025 1030 1035 1040 Leu Gln Glu Lys Ser Arg Ile Cys Ala Gln LeuAla Arg Phe Ser Glu 1045 1050 1055 Glu Lys Phe His Phe Ser Asp Lys ProGln Pro Asp Ser Lys Pro Thr 1060 1065 1070 Tyr Ser Arg Gly Arg Ser SerAsn Ala Tyr Asp Pro Ser Gln Met Cys 1075 1080 1085 Ala Glu Lys Gln LeuGlu Leu Arg Leu Arg Asp Phe Leu Leu Asp Ile 1090 1095 1100 Glu Asp ArgIle Tyr Gln Gly Thr Leu Gly Ala Ile Lys Val Thr Asp 1105 1110 1115 1120Arg His Ile Trp Arg Ser Ala Leu Glu Ser Gly Arg Tyr Glu Leu Leu 11251130 1135 Ser Glu Glu Asn Lys Glu Asn Gly Ile Ile Lys Thr Val Asn GluAsp 1140 1145 1150 Val Glu Glu Met Glu Ile Asp Glu Gln Thr Lys Val IleVal Lys Asp 1155 1160 1165 Arg Leu Leu Gly Ile Lys Thr Glu Thr Pro SerThr Val Ser Thr Asn 1170 1175 1180 Ala Ser Thr Pro Gln Ser Val Ser SerVal Val His Tyr Leu Ala Met 1185 1190 1195 1200 Ala Leu Phe Gln Ile GluGln Gly Ile Glu Arg Arg Phe Leu Lys Ala 1205 1210 1215 Pro Leu Asp AlaSer Asp Ser Gly Arg Ser Tyr Lys Thr Val Leu Asp 1220 1225 1230 Arg TrpArg Glu Ser Leu Leu Ser Ser Ala Ser Leu Ser Gln Val Phe 1235 1240 1245Leu His Leu Ser Thr Leu Asp Arg Ser Val Ile Trp Ser Lys Ser Ile 12501255 1260 Leu Asn Ala Arg Cys Lys Ile Cys Arg Lys Lys Gly Asp Ala GluAsn 1265 1270 1275 1280 Met Val Leu Cys Asp Gly Cys Asp Arg Gly His HisThr Tyr Cys Val 1285 1290 1295 Arg Pro Lys Leu Lys Thr Val Pro Glu GlyAsp Trp Phe Cys Pro Glu 1300 1305 1310 Cys Arg Pro Lys Gln Arg Cys ArgArg Leu Ser Phe Arg Gln Arg Pro 1315 1320 1325 Ser Leu Glu Ser Asp GluAsp Val Glu Asp Ser Met Gly Gly Glu Asp 1330 1335 1340 Asp Glu Val AspGly Asp Glu Glu Glu Gly Gln Ser Glu Glu Glu Glu 1345 1350 1355 1360 TyrGlu Val Glu Gln Asp Glu Asp Asp Ser Gln Glu Glu Glu Glu Val 1365 13701375 Ser Leu Pro Lys Arg Gly Arg Pro Gln Val Arg Leu Pro Val Lys Thr1380 1385 1390 Arg Gly Lys Leu Ser Ser Ser Phe Ser Ser Arg Gly Gln GlnGln Glu 1395 1400 1405 Pro Gly Arg Tyr Pro Ser Arg Ser Gln Gln Ser ThrPro Lys Thr Thr 1410 1415 1420 Val Ser Ser Lys Thr Gly Arg Ser Leu ArgLys Ile Asn Ser Ala Pro 1425 1430 1435 1440 Pro Thr Glu Thr Lys Ser LeuArg Ile Ala Ser Arg Ser Thr Arg His 1445 1450 1455 Ser His Gly Pro LeuGln Ala Asp Val Phe Val Glu Leu Leu Ser Pro 1460 1465 1470 Arg Arg LysArg Arg Gly Arg Lys Ser Ala Asn Asn Thr Pro Glu Asn 1475 1480 1485 SerPro Asn Phe Pro Asn Phe Arg Val Ile Ala Thr Lys Ser Ser Glu 1490 14951500 Gln Ser Arg Ser Val Asn Ile Ala Ser Lys Leu Ser Leu Gln Glu Ser1505 1510 1515 1520 Glu Ser Lys Arg Arg Cys Arg Lys Arg Gln Ser Pro GluPro Ser Pro 1525 1530 1535 Val Thr Leu Gly Arg Arg Ser Ser Gly Arg GlnGly Gly Val His Glu 1540 1545 1550 Leu Ser Ala Phe Glu Gln Leu Val ValGlu Leu Val Arg His Asp Asp 1555 1560 1565 Ser Trp Pro Phe Leu Lys LeuVal Ser Lys Ile Gln Val Pro Asp Tyr 1570 1575 1580 Tyr Asp Ile Ile LysLys Pro Ile Ala Leu Asn Ile Ile Arg Glu Lys 1585 1590 1595 1600 Val AsnLys Cys Glu Tyr Lys Leu Ala Ser Glu Phe Ile Asp Asp Ile 1605 1610 1615Glu Leu Met Phe Ser Asn Cys Phe Glu Tyr Asn Pro Arg Asn Thr Ser 16201625 1630 Glu Ala Lys Ala Gly Thr Arg Leu Gln Ala Phe Phe His Ile GlnAla 1635 1640 1645 Gln Lys Leu Gly Leu His Val Thr Pro Ser Asn Val AspGln Val Ser 1650 1655 1660 Thr Pro Pro Ala Ala Lys Lys Ser Arg Ile 16651670 <210> SEQ ID NO 2 <211> LENGTH: 5934 <212> TYPE: DNA <213>ORGANISM: Homo sapiens <220> FEATURE: <221> NAME/KEY: CDS <222>LOCATION: (125)...(5146) <221> NAME/KEY: misc_feature <222> LOCATION:(1)...(5934) <223> OTHER INFORMATION: n = A,T,C or G <400> SEQUENCE: 2gaattccggc ttttcccatc gtgtagtcaa gagtctgtgc cagacttgaa ggctttactt 60tgttagccat gtgtttatga acccccagcg ctttccctag atcttttggc tgataatctc 120aaac atg gag gat gct tct gaa tct tca cga ggg gtt gct cca tta att 169 MetGlu Asp Ala Ser Glu Ser Ser Arg Gly Val Ala Pro Leu Ile 1 5 10 15 aataat gta gtt ctc cca ggc tct ccg ctg tct ctt cct gta tca gtg 217 Asn AsnVal Val Leu Pro Gly Ser Pro Leu Ser Leu Pro Val Ser Val 20 25 30 aca ggctgt aaa agt cat cga gta gcc aat aaa aag gta gaa gcg agg 265 Thr Gly CysLys Ser His Arg Val Ala Asn Lys Lys Val Glu Ala Arg 35 40 45 agt gaa aagctc ctc cca aca gct ctt cct cct tca gag ccg aaa gta 313 Ser Glu Lys LeuLeu Pro Thr Ala Leu Pro Pro Ser Glu Pro Lys Val 50 55 60 gat cag aaa cttccc agg agc tcc gag agg cgg gga agt ggc ggt ggg 361 Asp Gln Lys Leu ProArg Ser Ser Glu Arg Arg Gly Ser Gly Gly Gly 65 70 75 acg caa ttc ccc gcgcgg agt cgg gca gtg gca gcg gga gaa gcg gca 409 Thr Gln Phe Pro Ala ArgSer Arg Ala Val Ala Ala Gly Glu Ala Ala 80 85 90 95 gcc agg ggc gcg gcgggg ccg gag aga ggc agt ccc ctg gga aga cgg 457 Ala Arg Gly Ala Ala GlyPro Glu Arg Gly Ser Pro Leu Gly Arg Arg 100 105 110 gtc tcc cct cgt tgcctt tgt agt gga gaa ggt gga caa gtg gca gtc 505 Val Ser Pro Arg Cys LeuCys Ser Gly Glu Gly Gly Gln Val Ala Val 115 120 125 ggc gtg atc gca gggaag cgg ggc cgg cgc ggg cgc gac ggg tcc agg 553 Gly Val Ile Ala Gly LysArg Gly Arg Arg Gly Arg Asp Gly Ser Arg 130 135 140 cga gcc ccg ggc ggacgg gag atg ccg ctg cta cac cga aag ccg ttt 601 Arg Ala Pro Gly Gly ArgGlu Met Pro Leu Leu His Arg Lys Pro Phe 145 150 155 gtg aga cag aag ccgccc gcg gac ctg cgg ccc gac gag gaa gtt ttc 649 Val Arg Gln Lys Pro ProAla Asp Leu Arg Pro Asp Glu Glu Val Phe 160 165 170 175 tac tgt aaa gtcacc aac gag atc ttc cgc cac tac gat gac ttt ttt 697 Tyr Cys Lys Val ThrAsn Glu Ile Phe Arg His Tyr Asp Asp Phe Phe 180 185 190 gaa cga acc attctg tgc aac agc ctt gtg tgg agt tgt gct gtg acg 745 Glu Arg Thr Ile LeuCys Asn Ser Leu Val Trp Ser Cys Ala Val Thr 195 200 205 ggt aga cct ggactg acg tat cag gaa gca ctt gag tca gaa aaa aaa 793 Gly Arg Pro Gly LeuThr Tyr Gln Glu Ala Leu Glu Ser Glu Lys Lys 210 215 220 gca aga cag aatctt cag agt ttt cca gaa cca cta att att cca gtt 841 Ala Arg Gln Asn LeuGln Ser Phe Pro Glu Pro Leu Ile Ile Pro Val 225 230 235 tta tac ttg accagc ctt acc cat cgt tcg cgc tta cat gaa att tgt 889 Leu Tyr Leu Thr SerLeu Thr His Arg Ser Arg Leu His Glu Ile Cys 240 245 250 255 gat gat atcttt gca tat gtc aag gat cga tat ttt gtc gaa gaa act 937 Asp Asp Ile PheAla Tyr Val Lys Asp Arg Tyr Phe Val Glu Glu Thr 260 265 270 gtg gaa gtcatt agg aac aat ggt gca agg ttg cag tgt acg att ttg 985 Val Glu Val IleArg Asn Asn Gly Ala Arg Leu Gln Cys Thr Ile Leu 275 280 285 gaa gtc ctccct cca tca cat caa aat ggt ttt gct aat gga cat gtt 1033 Glu Val Leu ProPro Ser His Gln Asn Gly Phe Ala Asn Gly His Val 290 295 300 aac agt gtggat gga gaa act att atc atc agt gat agt gat gat tca 1081 Asn Ser Val AspGly Glu Thr Ile Ile Ile Ser Asp Ser Asp Asp Ser 305 310 315 gaa aca caaagc tgt tct ttt caa aat ggg aag aaa aaa gat gca att 1129 Glu Thr Gln SerCys Ser Phe Gln Asn Gly Lys Lys Lys Asp Ala Ile 320 325 330 335 gat ccctta cta ttc aag tat aaa gtg caa ccc act aaa aaa gaa tta 1177 Asp Pro LeuLeu Phe Lys Tyr Lys Val Gln Pro Thr Lys Lys Glu Leu 340 345 350 cat gagtct gct att gtt aaa gca aca caa atc agc cgg aga aaa cac 1225 His Glu SerAla Ile Val Lys Ala Thr Gln Ile Ser Arg Arg Lys His 355 360 365 cta ttttct cgt gat aaa cta aag ctt ttt ctg aag caa cac tgt gaa 1273 Leu Phe SerArg Asp Lys Leu Lys Leu Phe Leu Lys Gln His Cys Glu 370 375 380 cca caagaa gga gtc att aaa ata aag gca tca tct ctt tca acg tat 1321 Pro Gln GluGly Val Ile Lys Ile Lys Ala Ser Ser Leu Ser Thr Tyr 385 390 395 aaa atagca gaa caa gat ttt tct tat ttc ttc cct gat gat cca ccc 1369 Lys Ile AlaGlu Gln Asp Phe Ser Tyr Phe Phe Pro Asp Asp Pro Pro 400 405 410 415 acattt atc ttc agt cct gct aac aga cga aga ggg aga cct ccc aaa 1417 Thr PheIle Phe Ser Pro Ala Asn Arg Arg Arg Gly Arg Pro Pro Lys 420 425 430 cgaata cat att agt caa gag gac aat gtt gct aat aaa cag act ctt 1465 Arg IleHis Ile Ser Gln Glu Asp Asn Val Ala Asn Lys Gln Thr Leu 435 440 445 gcaagt tat agg agc aaa gct act aaa gaa aga gat aaa ctt ttg aaa 1513 Ala SerTyr Arg Ser Lys Ala Thr Lys Glu Arg Asp Lys Leu Leu Lys 450 455 460 caagaa gaa atg aag tca ctg gct ttt gaa aag gct aaa tta aaa aga 1561 Gln GluGlu Met Lys Ser Leu Ala Phe Glu Lys Ala Lys Leu Lys Arg 465 470 475 gaaaaa gca gat gcc cta gaa gcg aag aaa aaa gaa aaa gaa gat aaa 1609 Glu LysAla Asp Ala Leu Glu Ala Lys Lys Lys Glu Lys Glu Asp Lys 480 485 490 495gag aaa aag agg gaa gaa ttg aaa aaa att gtt gaa gaa gag aga cta 1657 GluLys Lys Arg Glu Glu Leu Lys Lys Ile Val Glu Glu Glu Arg Leu 500 505 510aag aaa aaa gaa gaa aaa gag agg ctt aaa gta gaa aga gaa aag gaa 1705 LysLys Lys Glu Glu Lys Glu Arg Leu Lys Val Glu Arg Glu Lys Glu 515 520 525aga gag aag tta cgt gaa gaa aag cga aag tat gtg gaa tac tta aaa 1753 ArgGlu Lys Leu Arg Glu Glu Lys Arg Lys Tyr Val Glu Tyr Leu Lys 530 535 540cag tgg agt aaa cct aga gaa gat atg gaa tgt gat gac ctt aag gaa 1801 GlnTrp Ser Lys Pro Arg Glu Asp Met Glu Cys Asp Asp Leu Lys Glu 545 550 555ctt cca gaa cca aca cca gtg aaa act aga cta cct cct gaa atc ttt 1849 LeuPro Glu Pro Thr Pro Val Lys Thr Arg Leu Pro Pro Glu Ile Phe 560 565 570575 ggt gat gct ctg atg gtt ttg gag ttc ctt aat gca ttt ggg gaa ctt 1897Gly Asp Ala Leu Met Val Leu Glu Phe Leu Asn Ala Phe Gly Glu Leu 580 585590 ttt gat ctt caa gat gag ttt cct gat gga gta acc cta gaa gta tta 1945Phe Asp Leu Gln Asp Glu Phe Pro Asp Gly Val Thr Leu Glu Val Leu 595 600605 gag gaa gct ctt gtt gga aat gac agt gaa ggc cca ctg tgt gaa ttg 1993Glu Glu Ala Leu Val Gly Asn Asp Ser Glu Gly Pro Leu Cys Glu Leu 610 615620 ctt ttt ttc ttc ctg act gca atc ttc cag gca ata gct gaa gaa gaa 2041Leu Phe Phe Phe Leu Thr Ala Ile Phe Gln Ala Ile Ala Glu Glu Glu 625 630635 gag gaa gta gcc aaa gag caa cta act gat gct gac acc aaa ggc tgc 2089Glu Glu Val Ala Lys Glu Gln Leu Thr Asp Ala Asp Thr Lys Gly Cys 640 645650 655 agt ttg aaa agt ttg gat ctt gat agc tgc act ctt tca gaa atc ctc2137 Ser Leu Lys Ser Leu Asp Leu Asp Ser Cys Thr Leu Ser Glu Ile Leu 660665 670 aga ctg cac atc tta gct tca ggt gct gat gta aca tca gca aat gca2185 Arg Leu His Ile Leu Ala Ser Gly Ala Asp Val Thr Ser Ala Asn Ala 675680 685 aag tat aga tat caa aaa cga gga gga ttt gat gct aca gat gat gct2233 Lys Tyr Arg Tyr Gln Lys Arg Gly Gly Phe Asp Ala Thr Asp Asp Ala 690695 700 tgt atg gag ctt cgt ttg agc aat ccc agt cta gtg aag aaa ctg tca2281 Cys Met Glu Leu Arg Leu Ser Asn Pro Ser Leu Val Lys Lys Leu Ser 705710 715 agc acc tca gtg tat gat ttg aca cca gga gaa aaa atg aag ata ctc2329 Ser Thr Ser Val Tyr Asp Leu Thr Pro Gly Glu Lys Met Lys Ile Leu 720725 730 735 cat gct ctc tgt gga aag cta ctg acc cta gtt tca act agg gatttt 2377 His Ala Leu Cys Gly Lys Leu Leu Thr Leu Val Ser Thr Arg Asp Phe740 745 750 att gaa gat tat gtt gat ata tta cga cag gca aag cag gag ttccgg 2425 Ile Glu Asp Tyr Val Asp Ile Leu Arg Gln Ala Lys Gln Glu Phe Arg755 760 765 gaa tta aaa gca gaa caa cat cga aaa gag agg gaa gaa gca gctgcc 2473 Glu Leu Lys Ala Glu Gln His Arg Lys Glu Arg Glu Glu Ala Ala Ala770 775 780 aga att cgt aaa agg aag gaa gaa aaa ctt aag gag caa gaa caaaaa 2521 Arg Ile Arg Lys Arg Lys Glu Glu Lys Leu Lys Glu Gln Glu Gln Lys785 790 795 atg aaa gag aaa caa gaa aaa ctg aaa gaa gat gag caa aga aattca 2569 Met Lys Glu Lys Gln Glu Lys Leu Lys Glu Asp Glu Gln Arg Asn Ser800 805 810 815 acg gca gat ata tct att ggg gag gaa gaa agg gaa gat tttgat act 2617 Thr Ala Asp Ile Ser Ile Gly Glu Glu Glu Arg Glu Asp Phe AspThr 820 825 830 agc att gag agc aaa gac aca gag caa aag gaa tta gat caagat atg 2665 Ser Ile Glu Ser Lys Asp Thr Glu Gln Lys Glu Leu Asp Gln AspMet 835 840 845 ttc act gaa gat gaa gat gac cca gga tca cat aaa aga ggcaga agg 2713 Phe Thr Glu Asp Glu Asp Asp Pro Gly Ser His Lys Arg Gly ArgArg 850 855 860 ggg aaa aga gga caa aat gga ttt aaa gaa ttt aca agg caagaa cag 2761 Gly Lys Arg Gly Gln Asn Gly Phe Lys Glu Phe Thr Arg Gln GluGln 865 870 875 atc aac tgt gta aca aga gag ctt ctt act gct gat gag gaagaa gca 2809 Ile Asn Cys Val Thr Arg Glu Leu Leu Thr Ala Asp Glu Glu GluAla 880 885 890 895 tta aaa cag gaa cac caa cga aaa gag aaa gag ctc ttagaa aaa atc 2857 Leu Lys Gln Glu His Gln Arg Lys Glu Lys Glu Leu Leu GluLys Ile 900 905 910 caa agt gcc ata gcc tgt acc aat atc ttt ccc ttg ggtcgc gac cgc 2905 Gln Ser Ala Ile Ala Cys Thr Asn Ile Phe Pro Leu Gly ArgAsp Arg 915 920 925 atg tat aga cga tac tgg att ttc cct tct att cct ggactc ttt att 2953 Met Tyr Arg Arg Tyr Trp Ile Phe Pro Ser Ile Pro Gly LeuPhe Ile 930 935 940 gaa gag gat tat tct ggt ctt act gaa gac atg ctg ttgcct aga cct 3001 Glu Glu Asp Tyr Ser Gly Leu Thr Glu Asp Met Leu Leu ProArg Pro 945 950 955 tca tca ttt cag aat aat gta cag tct caa gat cct caggta tcc act 3049 Ser Ser Phe Gln Asn Asn Val Gln Ser Gln Asp Pro Gln ValSer Thr 960 965 970 975 aaa act gga gag cct ttg atg tct gaa tct acc tccaac att gac caa 3097 Lys Thr Gly Glu Pro Leu Met Ser Glu Ser Thr Ser AsnIle Asp Gln 980 985 990 ggt cca cgt gac cat tct gtg cag ctg cca aaa ccagtg cat aag cca 3145 Gly Pro Arg Asp His Ser Val Gln Leu Pro Lys Pro ValHis Lys Pro 995 1000 1005 aat cgg tgg tgc ttt tac agt tct tgt gaa cagcta gac cag ctt att 3193 Asn Arg Trp Cys Phe Tyr Ser Ser Cys Glu Gln LeuAsp Gln Leu Ile 1010 1015 1020 gaa gct ctt aat tct aga gga cat aga gaaagt gcc tta aaa gaa act 3241 Glu Ala Leu Asn Ser Arg Gly His Arg Glu SerAla Leu Lys Glu Thr 1025 1030 1035 ttg tta caa gag aaa agc aga ata tgtgca cag cta gcc cgt ttt tct 3289 Leu Leu Gln Glu Lys Ser Arg Ile Cys AlaGln Leu Ala Arg Phe Ser 1040 1045 1050 1055 gaa gag aaa ttt cat ttt tcagac aaa cct cag cct gat agc aaa cca 3337 Glu Glu Lys Phe His Phe Ser AspLys Pro Gln Pro Asp Ser Lys Pro 1060 1065 1070 aca tat agt cgg gga agatct tcc aat gca tat gat cca tct cag atg 3385 Thr Tyr Ser Arg Gly Arg SerSer Asn Ala Tyr Asp Pro Ser Gln Met 1075 1080 1085 tgt gca gaa aag caactt gaa cta agg ctg aga gat ttt ctt tta gat 3433 Cys Ala Glu Lys Gln LeuGlu Leu Arg Leu Arg Asp Phe Leu Leu Asp 1090 1095 1100 att gaa gat agaatc tac caa gga aca tta gga gcc atc aag gtt aca 3481 Ile Glu Asp Arg IleTyr Gln Gly Thr Leu Gly Ala Ile Lys Val Thr 1105 1110 1115 gat cga catatc tgg aga tca gca tta gaa agt gga cgg tat gag ctg 3529 Asp Arg His IleTrp Arg Ser Ala Leu Glu Ser Gly Arg Tyr Glu Leu 1120 1125 1130 1135 ttaagt gag gaa aac aag gaa aat ggg ata att aaa act gtg aat gaa 3577 Leu SerGlu Glu Asn Lys Glu Asn Gly Ile Ile Lys Thr Val Asn Glu 1140 1145 1150gac gta gaa gag atg gaa att gat gaa caa aca aag gtc ata gta aaa 3625 AspVal Glu Glu Met Glu Ile Asp Glu Gln Thr Lys Val Ile Val Lys 1155 11601165 gac aga ctt ttg ggg ata aaa aca gaa act cca agt act gta tca aca3673 Asp Arg Leu Leu Gly Ile Lys Thr Glu Thr Pro Ser Thr Val Ser Thr1170 1175 1180 aat gca agt aca cca caa tca gtg agc agt gtg gtt cat tatctg gca 3721 Asn Ala Ser Thr Pro Gln Ser Val Ser Ser Val Val His Tyr LeuAla 1185 1190 1195 atg gca ctc ttt caa ata gag cag ggc att gag cgg cgtttt ctg aaa 3769 Met Ala Leu Phe Gln Ile Glu Gln Gly Ile Glu Arg Arg PheLeu Lys 1200 1205 1210 1215 gct cca ctt gat gcc agt gac agt ggg cgt tcttat aaa aca gtt ctg 3817 Ala Pro Leu Asp Ala Ser Asp Ser Gly Arg Ser TyrLys Thr Val Leu 1220 1225 1230 gac cgt tgg aga gag tct ctc ctt tct tctgct agt cta tcc caa gtt 3865 Asp Arg Trp Arg Glu Ser Leu Leu Ser Ser AlaSer Leu Ser Gln Val 1235 1240 1245 ttt ctt cac cta tcc acc ttg gat cgtagc gtg ata tgg tct aaa tct 3913 Phe Leu His Leu Ser Thr Leu Asp Arg SerVal Ile Trp Ser Lys Ser 1250 1255 1260 ata ctg aat gcg cgt tgc aag atatgt cga aag aaa ggc gat gct gaa 3961 Ile Leu Asn Ala Arg Cys Lys Ile CysArg Lys Lys Gly Asp Ala Glu 1265 1270 1275 aac atg gtt ctt tgt gat ggctgt gat agg ggt cat cat acc tac tgt 4009 Asn Met Val Leu Cys Asp Gly CysAsp Arg Gly His His Thr Tyr Cys 1280 1285 1290 1295 gtt cga cca aag ctcaag act gtg cct gaa gga gac tgg ttt tgt cca 4057 Val Arg Pro Lys Leu LysThr Val Pro Glu Gly Asp Trp Phe Cys Pro 1300 1305 1310 gaa tgt cga ccaaag caa cgt tgt aga aga ctg tcc ttt aga cag aga 4105 Glu Cys Arg Pro LysGln Arg Cys Arg Arg Leu Ser Phe Arg Gln Arg 1315 1320 1325 cca tcc ttggaa agt gat gaa gat gtg gaa gac agt atg gga ggt gag 4153 Pro Ser Leu GluSer Asp Glu Asp Val Glu Asp Ser Met Gly Gly Glu 1330 1335 1340 gat gatgaa gtt gat ggc gat gaa gaa gaa ggt caa agt gag gag gaa 4201 Asp Asp GluVal Asp Gly Asp Glu Glu Glu Gly Gln Ser Glu Glu Glu 1345 1350 1355 gagtat gag gta gaa caa gat gaa gat gac tct caa gaa gag gaa gaa 4249 Glu TyrGlu Val Glu Gln Asp Glu Asp Asp Ser Gln Glu Glu Glu Glu 1360 1365 13701375 gtc agc cta ccc aaa cga gga aga cca caa gtt aga ttg cca gtt aaa4297 Val Ser Leu Pro Lys Arg Gly Arg Pro Gln Val Arg Leu Pro Val Lys1380 1385 1390 aca aga ggg aaa ctt agc tct tct ttc tca agt cgt ggc caacaa caa 4345 Thr Arg Gly Lys Leu Ser Ser Ser Phe Ser Ser Arg Gly Gln GlnGln 1395 1400 1405 gaa cct gga aga tac cct tcc agg agt cag cag agc acaccc aaa aca 4393 Glu Pro Gly Arg Tyr Pro Ser Arg Ser Gln Gln Ser Thr ProLys Thr 1410 1415 1420 act gtt tct tct aaa act ggt aga agc cta aga aagata aac tct gct 4441 Thr Val Ser Ser Lys Thr Gly Arg Ser Leu Arg Lys IleAsn Ser Ala 1425 1430 1435 cct cct aca gaa aca aaa tct tta aga att gccagt cgt tct act cgc 4489 Pro Pro Thr Glu Thr Lys Ser Leu Arg Ile Ala SerArg Ser Thr Arg 1440 1445 1450 1455 cac agt cat ggc cca ctg caa gca gatgta ttt gtg gaa ttg ctt agt 4537 His Ser His Gly Pro Leu Gln Ala Asp ValPhe Val Glu Leu Leu Ser 1460 1465 1470 cct cgt aga aaa cgc aga ggc aggaaa agt gct aat aat aca cca gaa 4585 Pro Arg Arg Lys Arg Arg Gly Arg LysSer Ala Asn Asn Thr Pro Glu 1475 1480 1485 aat agt ccc aac ttc cct aacttc aga gtc att gcc aca aag tca agt 4633 Asn Ser Pro Asn Phe Pro Asn PheArg Val Ile Ala Thr Lys Ser Ser 1490 1495 1500 gaa cag tca aga tct gtaaat att gct tca aaa ctt tct ctc caa gag 4681 Glu Gln Ser Arg Ser Val AsnIle Ala Ser Lys Leu Ser Leu Gln Glu 1505 1510 1515 agt gaa tcc aaa agaaga tgc aga aaa aga caa tct cca gag cca tcg 4729 Ser Glu Ser Lys Arg ArgCys Arg Lys Arg Gln Ser Pro Glu Pro Ser 1520 1525 1530 1535 cct gtg acactg ggt cga agg agt tct ggc cga cag gga gga gtt cat 4777 Pro Val Thr LeuGly Arg Arg Ser Ser Gly Arg Gln Gly Gly Val His 1540 1545 1550 gaa ttgtct gct ttt gaa caa ctt gtt gta gaa ttg gta cga cat gat 4825 Glu Leu SerAla Phe Glu Gln Leu Val Val Glu Leu Val Arg His Asp 1555 1560 1565 gacagc tgg cct ttt ttg aaa ctt gtt tct aaa atc cag gtc cca gac 4873 Asp SerTrp Pro Phe Leu Lys Leu Val Ser Lys Ile Gln Val Pro Asp 1570 1575 1580tac tat gac atc atc aaa aag ccc att gcc tta aat ata att cgt gaa 4921 TyrTyr Asp Ile Ile Lys Lys Pro Ile Ala Leu Asn Ile Ile Arg Glu 1585 15901595 aaa gtg aat aag tgt gaa tat aaa tta gca tct gag ttt att gat gac4969 Lys Val Asn Lys Cys Glu Tyr Lys Leu Ala Ser Glu Phe Ile Asp Asp1600 1605 1610 1615 att gag tta atg ttt tcg aac tgc ttt gaa tac aac cctcgt aac aca 5017 Ile Glu Leu Met Phe Ser Asn Cys Phe Glu Tyr Asn Pro ArgAsn Thr 1620 1625 1630 agt gaa gca aaa gct gga act agg ctt caa gca tttttt cat att cag 5065 Ser Glu Ala Lys Ala Gly Thr Arg Leu Gln Ala Phe PheHis Ile Gln 1635 1640 1645 gct caa aag ctt gga ctc cac gtc aca ccc agtaat gtg gac caa gtt 5113 Ala Gln Lys Leu Gly Leu His Val Thr Pro Ser AsnVal Asp Gln Val 1650 1655 1660 agc aca cca ccg gct gcg aaa aag tca cgaatc tgactttgtc cttctaaagg 5166 Ser Thr Pro Pro Ala Ala Lys Lys Ser ArgIle 1665 1670 atatatttga agaaaaacaa attgttcatg aaaatggaac attaaatcatgctgtataaa 5226 gcaataacaa acaattgatt gaccacatga aagtgtggcc tgcactatattctcaatttt 5286 aatattaagc actcaggaga atgtaggaaa gatatccttt gctacagttttgttcagtat 5346 ctaataagtt tgatagatgt attggataca gtactggttt acagaggtttttgtacattt 5406 ttganatcat tcatgtgtcc agagatcttg gaaaatattt tttcacccacgatttatttt 5466 gttattgatg atttattttt aaagtggtgg tattaaggga gagttatctacatggatgag 5526 tcttccgcta tagcacagtt tagaaaaggt gtttatgtct taattaattgtttgagtaca 5586 ttctttcaac actacacatg aatgaatcca atcttataac cttgaagtgctgtaccagtg 5646 ctggctgcag gtattaagtc caagtttatt aactagatat ttatttagtattgagagtaa 5706 tttgtgaatt tgttttgtat ttataaaatt tatacctgga aaatgttccttaatgtttta 5766 aaccttttac tgtgttttta ttcctctaac ttccttaatg atcaatcaaaaaaagtaaca 5826 ccctcccttt ttcctgacag ttctttcagc tttacagaac tgtattataagttcgatgta 5886 taattttaac tgttcaaata aaatacattt ttccaataaa aaaaaaaa5934 <210> SEQ ID NO 3 <211> LENGTH: 140 <212> TYPE: DNA <213> ORGANISM:Homo sapiens <400> SEQUENCE: 3 gactaccacg acatcatcaa gaaaccaatggatctgggca cagtcaagcg gaaaatggac 60 aatcgcgagt acaagagcgc gccggaatttgccgccgacg tgcgattaat attcaccaac 120 tgctacaagt acaatccgcc 140 <210> SEQID NO 4 <211> LENGTH: 23 <212> TYPE: DNA <213> ORGANISM: ArtificialSequence <220> FEATURE: <223> OTHER INFORMATION: Synthetically generatedprimer <400> SEQUENCE: 4 agaaaaagac aatctccaga gca 23 <210> SEQ ID NO 5<211> LENGTH: 24 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence<220> FEATURE: <223> OTHER INFORMATION: Synthetically generated primer<400> SEQUENCE: 5 gctgtcatca tgtcgtacca attc 24 <210> SEQ ID NO 6 <211>LENGTH: 24 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220>FEATURE: <223> OTHER INFORMATION: Synthetically generated primer <400>SEQUENCE: 6 aacacaagtg aagcaaaagc tgga 24 <210> SEQ ID NO 7 <211>LENGTH: 24 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220>FEATURE: <223> OTHER INFORMATION: Synthetically generated primer <400>SEQUENCE: 7 gtggtgtgct aacttggtcc acat 24 <210> SEQ ID NO 8 <211>LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220>FEATURE: <223> OTHER INFORMATION: Synthetically generated primer <400>SEQUENCE: 8 cccatcgtga gtcaagagtg tctgt 25 <210> SEQ ID NO 9 <211>LENGTH: 24 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220>FEATURE: <223> OTHER INFORMATION: Synthetically generated primer <400>SEQUENCE: 9 ctcgcttcta cctttttatt ggct 24 <210> SEQ ID NO 10 <211>LENGTH: 25 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220>FEATURE: <223> OTHER INFORMATION: Synthetically generated primer <400>SEQUENCE: 10 tcatcatctc tgccccctct gtctg 25 <210> SEQ ID NO 11 <211>LENGTH: 24 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220>FEATURE: <223> OTHER INFORMATION: Synthetically generated primer <400>SEQUENCE: 11 gacgcctgct tcaccacctt cttg 24 <210> SEQ ID NO 12 <211>LENGTH: 24 <212> TYPE: DNA <213> ORGANISM: Artificial Sequence <220>FEATURE: <223> OTHER INFORMATION: Synthetically generated primer <400>SEQUENCE: 12 tcatgtggtc aatcaattgt ttgt 24 <210> SEQ ID NO 13 <211>LENGTH: 1878 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <220> FEATURE:<221> NAME/KEY: VARIANT <222> LOCATION: (1)...(1878) <223> OTHERINFORMATION: Xaa = Any Amino Acid <400> SEQUENCE: 13 Met Glu Met Glu AlaAsn Glu Ala Asn Asp His Phe Asn Phe Thr Gly 1 5 10 15 Leu Pro Pro AlaPro Ala Ala Ser Gly Leu Lys Pro Ser Pro Ser Ser 20 25 30 Gly Glu Gly LeuTyr Thr Asn Gly Ser Pro Met Asn Phe Pro Gln Gln 35 40 45 Gly Lys Ser LeuAsn Gly Asp Val Asn Val Asn Gly Leu Ser Thr Val 50 55 60 Ser His Thr ThrThr Ser Gly Ile Leu Asn Ser Ala Pro His Ser Ser 65 70 75 80 Ser Thr SerHis Leu His His Pro Ser Val Ala Tyr Asp Cys Leu Trp 85 90 95 Asn Tyr SerGln Tyr Pro Ser Ala Asn Pro Gly Ser Asn Leu Lys Asp 100 105 110 Pro ProLeu Leu Ser Gln Phe Ser Gly Gly Gln Tyr Pro Leu Asn Gly 115 120 125 IleLeu Gly Gly Ser Arg Gln Pro Ser Ser Pro Ser His Asn Thr Asn 130 135 140Leu Arg Ala Gly Ser Gln Lys Phe Trp Ala Asn Gly Thr His Ser Pro 145 150155 160 Met Gly Leu Asn Phe Asp Ser Gln Glu Leu Tyr Asp Ser Phe Pro Asp165 170 175 Gln Asn Phe Glu Glu Val Cys Ser Gly Ile His Pro Asp Glu AlaAla 180 185 190 Glu Lys Glu Met Thr Ser Val Val Ala Glu Asn Gly Thr GlyLeu Val 195 200 205 Cys Ser Leu Glu Leu Glu Glu Xaa Gln Pro Glu Leu LysMet Cys Gly 210 215 220 Tyr Asn Gly Ser Val Pro Ser Val Glu Ser Leu HisGln Glu Val Ser 225 230 235 240 Val Leu Val Pro Asp Pro Thr Val Ser CysLeu Asp Asp Pro Ser His 245 250 255 Leu Pro Asp Gln Leu Glu Asp Thr ProIle Leu Ser Glu Asp Ser Leu 260 265 270 Glu Pro Phe Asn Ser Leu Ala ProGlu Pro Val Ser Gly Gly Leu Tyr 275 280 285 Gly Ile Asp Asp Thr Glu LeuMet Gly Ala Glu Asp Lys Leu Pro Leu 290 295 300 Xaa Asp Ser Pro Val IleSer Ala Leu Asp Cys Pro Ser Leu Asn Asn 305 310 315 320 Ala Thr Ala PheSer Leu Leu Ala Asp Asp Ser Gln Thr Ser Thr Ser 325 330 335 Ile Phe AlaSer Pro Thr Ser Pro Pro Val Leu Gly Glu Ser Val Leu 340 345 350 Gln AspAsn Ser Phe Asp Leu Asn Asn Gly Ser Asp Ala Glu Gln Glu 355 360 365 GluMet Glu Thr Gln Ser Ser Asp Phe Pro Pro Ser Leu Thr Gln Pro 370 375 380Ala Pro Asp Gln Ser Ser Thr Ile Gln Leu His Pro Ala Thr Ser Pro 385 390395 400 Ala Val Ser Pro Thr Thr Ser Pro Ala Val Ser Leu Val Val Ser Pro405 410 415 Ala Ala Ser Pro Glu Ile Ser Pro Glu Val Cys Pro Ala Ala SerThr 420 425 430 Val Val Ser Pro Ala Val Phe Ser Val Val Ser Pro Ala SerSer Ala 435 440 445 Val Leu Pro Ala Val Ser Leu Glu Val Pro Leu Thr AlaSer Val Thr 450 455 460 Ser Pro Lys Ala Ser Pro Val Thr Ser Pro Ala AlaAla Phe Pro Thr 465 470 475 480 Ala Ser Pro Ala Asn Lys Asp Val Ser SerPhe Leu Glu Thr Thr Ala 485 490 495 Asp Val Glu Glu Ile Thr Gly Glu GlyLeu Thr Ala Ser Gly Ser Gly 500 505 510 Asp Val Met Arg Arg Arg Ile AlaThr Pro Glu Glu Val Arg Leu Pro 515 520 525 Leu Gln His Gly Trp Arg ArgGlu Val Arg Ile Lys Lys Gly Ser His 530 535 540 Arg Trp Gln Gly Glu ThrTrp Tyr Tyr Gly Pro Cys Gly Lys Arg Met 545 550 555 560 Lys Gln Phe ProGlu Val Ile Lys Tyr Leu Ser Arg Asn Leu Val His 565 570 575 Ser Val ArgArg Glu His Phe Ser Phe Ser Pro Arg Met Pro Val Gly 580 585 590 Asp PhePhe Glu Glu Arg Asp Thr Pro Glu Gly Leu Gln Trp Val Gln 595 600 605 LeuSer Ala Glu Glu Ile Pro Ser Arg Ile Gln Ala Ile Thr Gly Lys 610 615 620Arg Gly Arg Pro Arg Asn Thr Glu Lys Ala Lys Thr Lys Glu Val Pro 625 630635 640 Lys Val Lys Arg Gly Arg Gly Arg Pro Pro Lys Val Lys Ile Thr Glu645 650 655 Leu Leu Asn Lys Thr Asp Asn Arg Pro Leu Lys Lys Leu Glu AlaGln 660 665 670 Glu Thr Leu Asn Glu Glu Asp Lys Ala Lys Ile Ala Lys SerLys Lys 675 680 685 Lys Met Arg Gln Lys Val Gln Arg Gly Glu Cys Leu ThrThr Ile Gln 690 695 700 Gly Gln Ala Arg Asn Lys Arg Lys Gln Glu Thr LysSer Leu Lys His 705 710 715 720 Lys Glu Ala Lys Lys Lys Ser Xaa Ala GluLys Glu Lys Gly Lys Thr 725 730 735 Lys Gln Glu Lys Leu Lys Glu Lys ValLys Arg Glu Lys Lys Glu Lys 740 745 750 Val Lys Met Lys Glu Lys Glu GluVal Thr Lys Ala Lys Pro Ala Cys 755 760 765 Lys Ala Asp Lys Thr Leu AlaThr Gln Arg Arg Leu Glu Glu Arg Gln 770 775 780 Lys Gln Gln Met Ile LeuGlu Glu Met Lys Lys Pro Thr Glu Asp Met 785 790 795 800 Cys Leu Thr AspHis Gln Pro Leu Pro Asp Phe Ser Arg Val Pro Gly 805 810 815 Leu Thr LeuPro Ser Gly Ala Phe Ser Asp Cys Leu Thr Ile Val Glu 820 825 830 Phe LeuHis Ser Phe Gly Lys Val Leu Gly Phe Asp Pro Ala Lys Asp 835 840 845 ValPro Ser Leu Gly Val Leu Gln Glu Gly Leu Leu Cys Gln Gly Asp 850 855 860Ser Leu Gly Glu Val Gln Asp Leu Leu Val Arg Leu Leu Lys Ala Ala 865 870875 880 Leu His Asp Pro Gly Phe Pro Ser Tyr Cys Gln Ser Leu Lys Ile Leu885 890 895 Gly Glu Lys Val Ser Glu Ile Pro Leu Thr Arg Asp Asn Val SerGlu 900 905 910 Ile Leu Arg Cys Phe Leu Met Ala Tyr Gly Val Xaa Pro AlaLeu Cys 915 920 925 Asp Arg Leu Arg Thr Gln Pro Phe Gln Ala Gln Pro ProGln Gln Lys 930 935 940 Ala Ala Val Leu Ala Phe Pro Val His Glu Leu AsnGly Ser Thr Leu 945 950 955 960 Ile Ile Asn Glu Ile Asp Lys Thr Leu GluSer Met Ser Ser Tyr Arg 965 970 975 Lys Asn Lys Trp Ile Val Glu Gly ArgLeu Arg Arg Leu Lys Thr Val 980 985 990 Leu Ala Lys Arg Thr Gly Arg SerGlu Val Glu Met Gly Arg Pro Glu 995 1000 1005 Glu Cys Leu Gly Arg ArgArg Ser Ser Arg Ile Met Glu Glu Thr Ser 1010 1015 1020 Gly Met Glu GluGlu Glu Glu Glu Glu Ser Ile Ala Ala Val Pro Gly 1025 1030 1035 1040 ArgArg Gly Arg Arg Asp Gly Glu Val Asp Ala Thr Ala Ser Ser Ile 1045 10501055 Pro Glu Leu Glu Arg Gln Ile Glu Lys Leu Ser Lys Arg Gln Leu Phe1060 1065 1070 Phe Arg Lys Lys Leu Leu His Ser Ser Gln Met Leu Arg AlaVal Ser 1075 1080 1085 Leu Gly Gln Asp Arg Tyr Arg Arg Arg Tyr Trp ValLeu Pro Tyr Leu 1090 1095 1100 Ala Gly Ile Phe Val Glu Gly Thr Glu GlyAsn Leu Val Pro Glu Glu 1105 1110 1115 1120 Val Ile Lys Lys Glu Thr AspSer Leu Lys Val Ala Ala His Ala Ser 1125 1130 1135 Leu Asn Pro Ala LeuPhe Ser Met Lys Met Glu Leu Ala Gly Ser Asn 1140 1145 1150 Thr Thr AlaSer Ser Pro Ala Arg Ala Arg Ser Arg Pro Leu Lys Thr 1155 1160 1165 LysPro Gly Phe Met Gln Pro Arg His Phe Lys Ser Pro Val Arg Gly 1170 11751180 Gln Asp Ser Glu Gln Pro Gln Ala Gln Leu Gln Pro Glu Ala Gln Leu1185 1190 1195 1200 His Val Pro Ala Gln Pro Gln Pro Gln Leu Gln Leu GlnLeu Gln Ser 1205 1210 1215 His Lys Gly Phe Leu Glu Gln Glu Gly Ser ProLeu Ser Leu Gly Gln 1220 1225 1230 Ser Gln His Asp Leu Ser Gln Ser AlaPhe Leu Ser Trp Leu Ser Gln 1235 1240 1245 Thr Gln Ser His Ser Ser LeuLeu Ser Ser Ser Val Leu Thr Pro Asp 1250 1255 1260 Ser Ser Pro Gly LysLeu Asp Pro Ala Pro Ser Gln Pro Pro Glu Glu 1265 1270 1275 1280 Pro GluPro Asp Glu Ala Glu Ser Ser Pro Asp Leu Gln Ala Phe Trp 1285 1290 1295Phe Asn Ile Ser Ala Gln Met Pro Cys Asn Ala Ala Pro Thr Pro Pro 13001305 1310 Leu Ala Val Ser Glu Asp Gln Pro Thr Pro Ser Pro Gln Gln LeuAla 1315 1320 1325 Ser Ser Lys Pro Met Asn Arg Pro Ser Ala Ala Asn ProCys Ser Pro 1330 1335 1340 Val Gln Phe Ser Ser Thr Pro Leu Ala Gly LeuAla Pro Lys Arg Arg 1345 1350 1355 1360 Ala Gly Asp Pro Gly Glu Met ProGln Ser Pro Thr Gly Leu Gly Gln 1365 1370 1375 Pro Lys Arg Arg Gly ArgPro Pro Ser Lys Phe Phe Lys Gln Met Glu 1380 1385 1390 Gln Arg Tyr LeuThr Gln Leu Thr Ala Gln Pro Val Pro Pro Glu Met 1395 1400 1405 Cys SerGly Trp Trp Trp Ile Pro Asp Pro Glu Met Leu Asp Ala Met 1410 1415 1420Leu Lys Ala Leu His Pro Arg Gly Ile Arg Glu Lys Ala Leu His Lys 14251430 1435 1440 His Leu Asn Lys His Arg Asp Phe Leu Gln Glu Val Cys LeuArg Pro 1445 1450 1455 Ser Ala Asp Pro Ile Phe Glu Pro Arg Gln Leu ProAla Phe Gln Glu 1460 1465 1470 Gly Ile Met Ser Trp Ser Pro Lys Glu LysThr Tyr Glu Thr Asp Leu 1475 1480 1485 Ala Val Leu Gln Trp Val Glu GluLeu Glu Gln Arg Val Ile Met Ser 1490 1495 1500 Asp Leu Gln Ile Arg GlyTrp Thr Cys Pro Ser Pro Asp Ser Thr Arg 1505 1510 1515 1520 Glu Asp LeuAla Tyr Cys Glu His Leu Ser Asp Ser Gln Glu Asp Ile 1525 1530 1535 ThrTrp Arg Gly Pro Gly Arg Glu Gly Leu Ala Pro Gln Arg Lys Thr 1540 15451550 Thr Asn Pro Leu Asp Leu Ala Val Met Arg Leu Ala Ala Leu Glu Gln1555 1560 1565 Asn Val Lys Arg Arg Tyr Leu Arg Glu Pro Leu Trp Pro ThrHis Glu 1570 1575 1580 Val Val Leu Glu Lys Ala Leu Leu Ser Thr Pro AsnGly Ala Pro Glu 1585 1590 1595 1600 Gly Thr Thr Thr Glu Ile Ser Tyr GluIle Thr Pro Arg Ile Arg Ile 1605 1610 1615 Trp Arg Gln Thr Leu Gln ArgCys Arg Ser Ala Ala His Val Cys Leu 1620 1625 1630 Cys Leu Gly His LeuGlu Arg Ser Ile Ala Trp Glu Lys Ser Val Asn 1635 1640 1645 Lys Val ThrCys Leu Val Cys Arg Lys Gly Asp Asn Asp Glu Phe Leu 1650 1655 1660 LeuLeu Cys Asp Gly Cys Asp Arg Gly Cys His Ile Tyr Cys His Arg 1665 16701675 1680 Pro Lys Met Glu Ala Val Pro Glu Gly Asp Trp Phe Cys Thr ValCys 1685 1690 1695 Leu Ala Gln Gln Val Glu Gly Glu Phe Thr Gln Lys ProGly Phe Pro 1700 1705 1710 Lys Arg Gly Gln Lys Arg Lys Ser Gly Tyr SerLeu Asn Phe Ser Glu 1715 1720 1725 Gly Asp Gly Arg Arg Arg Arg Val LeuLeu Lys Gly Arg Glu Ser Pro 1730 1735 1740 Ala Ala Gly Pro Arg Tyr SerGlu Glu Arg Leu Ser Pro Ser Lys Arg 1745 1750 1755 1760 Arg Arg Leu SerMet Arg Asn His His Ser Asp Leu Thr Phe Cys Glu 1765 1770 1775 Ile IleLeu Met Glu Met Glu Ser His Asp Ala Ala Trp Pro Phe Xaa 1780 1785 1790Glu Pro Val Asn Pro Arg Leu Val Ser Gly Tyr Arg Arg Ile Ile Lys 17951800 1805 Asn Pro Met Asp Phe Ser Thr Met Arg Glu Arg Leu Leu Arg GlyGly 1810 1815 1820 Tyr Thr Ser Ser Glu Glu Phe Ala Ala Asp Ala Leu LeuVal Phe Asp 1825 1830 1835 1840 Asn Cys Gln Thr Phe Asn Glu Asp Asp SerGlu Val Gly Lys Ala Gly 1845 1850 1855 His Ile Met Arg Arg Phe Phe GluSer Arg Trp Glu Glu Phe Tyr Gln 1860 1865 1870 Gly Lys Gln Ala Asn Leu1875 <210> SEQ ID NO 14 <211> LENGTH: 9408 <212> TYPE: DNA <213>ORGANISM: Homo sapiens <220> FEATURE: <221> NAME/KEY: CDS <222>LOCATION: (740)...(6373) <221> NAME/KEY: misc_feature <222> LOCATION:(1)...(9408) <223> OTHER INFORMATION: n = A,T,C or G <400> SEQUENCE: 14gttccactca cagatttctt cttattgccc aggcttgagt gcaatgacct gttctcagct 60tactcaacct ctgcctcttg ggtttcagtg atttttctgc ctcggcctcc tgagtagcta 120agggaggagt cttgagatta tcatccacgg agggtggaag aggagagggt ggaaggggaa 180taagaagaca ttcgggaggt gtcttgaggc tcagggagtt atcagttata gaatgttgtt 240gagttggagg aagtggctgg cggcccatcc tgttttttaa agtttcacct gtgaagtagg 300gccagtaggg caatcctgaa gaatgacgat gctccgctgc cgccattctg acctgtaagg 360ccgaaagaaa gggaatgttt tcacacatat tcatttgatg gacaaaatta ccgccaccaa 420cacggtctgc accttctgtt gctggtgata aatttttgca cctttccatc ctccaggttt 480caaaatagca gtatcagtgt cataatatca cccttccact gagtactgcc gacagctggg 540gagtaaaaaa aagtcattgg gacatcaccc tacagcagtt caggctgtgt ggttcgcaag 600aagcatacac tggctttttg attcttgcta gttcccagct cacagtttgg gaggatccaa 660caccaacctt tacgtgaagt ggaggcccaa ggacagtgag gagctgggtg gtcccagcct 720ggagctgtgc cagcctgac atg gaa atg gag gca aac gag gca aac gac cat 772 MetGlu Met Glu Ala Asn Glu Ala Asn Asp His 1 5 10 ttt aac ttt act ggc cttccc cct gca cct gct gcc tca gga ctg aaa 820 Phe Asn Phe Thr Gly Leu ProPro Ala Pro Ala Ala Ser Gly Leu Lys 15 20 25 ccc tct cct tcc tca ggg gagggc ctc tac act aac ggg tct ccc atg 868 Pro Ser Pro Ser Ser Gly Glu GlyLeu Tyr Thr Asn Gly Ser Pro Met 30 35 40 aac ttc ccc cag caa ggg aaa agtttg aat ggg gat gtg aat gtt aat 916 Asn Phe Pro Gln Gln Gly Lys Ser LeuAsn Gly Asp Val Asn Val Asn 45 50 55 ggc tta tct act gta tct cac act actact tca ggg att ttg aac tct 964 Gly Leu Ser Thr Val Ser His Thr Thr ThrSer Gly Ile Leu Asn Ser 60 65 70 75 gct ccc cac tcc tcc agc acc tca cacctc cat cac ccc agc gtg gcc 1012 Ala Pro His Ser Ser Ser Thr Ser His LeuHis His Pro Ser Val Ala 80 85 90 tac gac tgt ctc tgg aac tac tca cag taccca tct gcc aat cct ggc 1060 Tyr Asp Cys Leu Trp Asn Tyr Ser Gln Tyr ProSer Ala Asn Pro Gly 95 100 105 agc aac ctc aag gac cca ccc ctt ctc tcccag ttc tcg ggg gga caa 1108 Ser Asn Leu Lys Asp Pro Pro Leu Leu Ser GlnPhe Ser Gly Gly Gln 110 115 120 tac cca ctc aac ggc atc ctt ggg ggc agccgg caa cct tca tcc cca 1156 Tyr Pro Leu Asn Gly Ile Leu Gly Gly Ser ArgGln Pro Ser Ser Pro 125 130 135 agt cat aac act aac ctt cgg gct ggg agccaa aag ttc tgg gcc aac 1204 Ser His Asn Thr Asn Leu Arg Ala Gly Ser GlnLys Phe Trp Ala Asn 140 145 150 155 ggt acc cat agt ccc atg ggg ctt aacttt gat tca caa gaa ctg tat 1252 Gly Thr His Ser Pro Met Gly Leu Asn PheAsp Ser Gln Glu Leu Tyr 160 165 170 gat tcc ttt cct gac cag aat ttt gaggag gta tgc agt ggt atc cat 1300 Asp Ser Phe Pro Asp Gln Asn Phe Glu GluVal Cys Ser Gly Ile His 175 180 185 cct gat gag gca gca gaa aaa gag atgact tcc gtt gtg gca gaa aat 1348 Pro Asp Glu Ala Ala Glu Lys Glu Met ThrSer Val Val Ala Glu Asn 190 195 200 ggc act ggc ttg gta tgc agc ttg gagctg gaa gaa naa cag cca gaa 1396 Gly Thr Gly Leu Val Cys Ser Leu Glu LeuGlu Glu Xaa Gln Pro Glu 205 210 215 ctg aag atg tgt ggc tac aat ggc tctgtc cct tct gtg gaa tcg tta 1444 Leu Lys Met Cys Gly Tyr Asn Gly Ser ValPro Ser Val Glu Ser Leu 220 225 230 235 cac caa gag gtc tca gtc ctg gtccct gac ccc aca gtg agc tgt tta 1492 His Gln Glu Val Ser Val Leu Val ProAsp Pro Thr Val Ser Cys Leu 240 245 250 gat gat cct tca cat ctt cct gatcaa ctg gaa gac act cca atc ctc 1540 Asp Asp Pro Ser His Leu Pro Asp GlnLeu Glu Asp Thr Pro Ile Leu 255 260 265 agt gaa gac tct ctg gag ccc ttcaac tct ctg gca cca gag cca gtg 1588 Ser Glu Asp Ser Leu Glu Pro Phe AsnSer Leu Ala Pro Glu Pro Val 270 275 280 agt gga gga cta tat ggt att gatgac acg gag ctg atg ggt gca gaa 1636 Ser Gly Gly Leu Tyr Gly Ile Asp AspThr Glu Leu Met Gly Ala Glu 285 290 295 gac aag ctg cct ctt gan gac agccct gtg att tct gcc ctt gat tgc 1684 Asp Lys Leu Pro Leu Xaa Asp Ser ProVal Ile Ser Ala Leu Asp Cys 300 305 310 315 cct tcc ctc aat aat gct actgcc ttc agt ctc ctg gca gat gat agt 1732 Pro Ser Leu Asn Asn Ala Thr AlaPhe Ser Leu Leu Ala Asp Asp Ser 320 325 330 caa aca tca act tct atc tttgcc agt ccc act tct cca cct gtc cta 1780 Gln Thr Ser Thr Ser Ile Phe AlaSer Pro Thr Ser Pro Pro Val Leu 335 340 345 ggg gag tct gtc ctg caa gataac agc ttt gac ctg aat aat ggt agt 1828 Gly Glu Ser Val Leu Gln Asp AsnSer Phe Asp Leu Asn Asn Gly Ser 350 355 360 gac gct gaa cag gaa gaa atggaa act caa tct tca gac ttc cca cca 1876 Asp Ala Glu Gln Glu Glu Met GluThr Gln Ser Ser Asp Phe Pro Pro 365 370 375 tcc ctg acc cag cca gct cctgat cag tca tcc act att cag cta cat 1924 Ser Leu Thr Gln Pro Ala Pro AspGln Ser Ser Thr Ile Gln Leu His 380 385 390 395 cca gca acc tca cca gcagtc tcg cca aca acc tcc cca gca gtc tcc 1972 Pro Ala Thr Ser Pro Ala ValSer Pro Thr Thr Ser Pro Ala Val Ser 400 405 410 cta gtg gtt tct cca gcagcc tcc cca gaa atc tct cca gaa gtt tgt 2020 Leu Val Val Ser Pro Ala AlaSer Pro Glu Ile Ser Pro Glu Val Cys 415 420 425 ccc gca gct tct aca gttgtc tct cca gca gtc ttc tca gtg gtc tct 2068 Pro Ala Ala Ser Thr Val ValSer Pro Ala Val Phe Ser Val Val Ser 430 435 440 cca gct tct tca gca gtcctc cca gca gtc tcc tta gaa gtc ccg ttg 2116 Pro Ala Ser Ser Ala Val LeuPro Ala Val Ser Leu Glu Val Pro Leu 445 450 455 acg gct tca gtg aca tcccca aaa gcc tct ccc gta act tcc cca gca 2164 Thr Ala Ser Val Thr Ser ProLys Ala Ser Pro Val Thr Ser Pro Ala 460 465 470 475 gct gcc ttt cca acagcc tcc cca gca aat aag gat gtc agc agc ttt 2212 Ala Ala Phe Pro Thr AlaSer Pro Ala Asn Lys Asp Val Ser Ser Phe 480 485 490 cta gaa acc act gctgac gtg gaa gag atc act gga gaa gga ctc act 2260 Leu Glu Thr Thr Ala AspVal Glu Glu Ile Thr Gly Glu Gly Leu Thr 495 500 505 gct tct ggt agt ggtgat gtc atg agg aga cgt att gct acc cca gaa 2308 Ala Ser Gly Ser Gly AspVal Met Arg Arg Arg Ile Ala Thr Pro Glu 510 515 520 gaa gtt cgt ctt cccctc caa cat ggg tgg cgg aga gag gtg cgc atc 2356 Glu Val Arg Leu Pro LeuGln His Gly Trp Arg Arg Glu Val Arg Ile 525 530 535 aag aag ggc agc caccga tgg cag ggg gag acn tgg tat tat ggc ccc 2404 Lys Lys Gly Ser His ArgTrp Gln Gly Glu Thr Trp Tyr Tyr Gly Pro 540 545 550 555 tgt ggg aag aggatg aag caa ttt cca gaa gtg atc aag tac ctg agc 2452 Cys Gly Lys Arg MetLys Gln Phe Pro Glu Val Ile Lys Tyr Leu Ser 560 565 570 cgc aac ctg gtacac agt gtc cgc cga gag cac ttc agc ttc agt ccc 2500 Arg Asn Leu Val HisSer Val Arg Arg Glu His Phe Ser Phe Ser Pro 575 580 585 cgt atg cct gttgga gat ttc ttt gaa gaa aga gac acg cca gag ggc 2548 Arg Met Pro Val GlyAsp Phe Phe Glu Glu Arg Asp Thr Pro Glu Gly 590 595 600 ttg cag tgg gtgcag ctc tca gca gag gag atc ccg tcg agg att cag 2596 Leu Gln Trp Val GlnLeu Ser Ala Glu Glu Ile Pro Ser Arg Ile Gln 605 610 615 gca att act ggcaaa cgg ggt cga cct cga aac act gag aag gct aag 2644 Ala Ile Thr Gly LysArg Gly Arg Pro Arg Asn Thr Glu Lys Ala Lys 620 625 630 635 act aag gaagtc ccc aag gtg aaa cgg ggt cga ggt cgg cca cct aag 2692 Thr Lys Glu ValPro Lys Val Lys Arg Gly Arg Gly Arg Pro Pro Lys 640 645 650 gtc aaa atcact gag cta ttg aac aag aca gac aac cgc ccc cta aag 2740 Val Lys Ile ThrGlu Leu Leu Asn Lys Thr Asp Asn Arg Pro Leu Lys 655 660 665 aaa ctg gaggcc caa gaa aca ttg aat gag gag gat aaa gca aag att 2788 Lys Leu Glu AlaGln Glu Thr Leu Asn Glu Glu Asp Lys Ala Lys Ile 670 675 680 gct aaa agcaag aag aag atg agg cag aag gtt caa cgg gga gag tgt 2836 Ala Lys Ser LysLys Lys Met Arg Gln Lys Val Gln Arg Gly Glu Cys 685 690 695 ctn act actatc caa ggg cag gcc aga aat aag cgg aaa caa gag acc 2884 Leu Thr Thr IleGln Gly Gln Ala Arg Asn Lys Arg Lys Gln Glu Thr 700 705 710 715 aag agctta aag cac aag gaa gct aag aag aaa tcc nag gct gag aaa 2932 Lys Ser LeuLys His Lys Glu Ala Lys Lys Lys Ser Xaa Ala Glu Lys 720 725 730 gaa aaagga aag aca aag cag gaa aaa ctg aag gaa aaa gtc aag agg 2980 Glu Lys GlyLys Thr Lys Gln Glu Lys Leu Lys Glu Lys Val Lys Arg 735 740 745 gaa aagaag gag aag gtt aaa atg aag gaa aag gag gag gtg acc aaa 3028 Glu Lys LysGlu Lys Val Lys Met Lys Glu Lys Glu Glu Val Thr Lys 750 755 760 gcc aagcca gcc tgt aaa gca gat aag acc ctg gcc aca cag agg cgc 3076 Ala Lys ProAla Cys Lys Ala Asp Lys Thr Leu Ala Thr Gln Arg Arg 765 770 775 ttg gaggaa cgg cag aag cag cag atg atc ttg gag gaa atg aag aag 3124 Leu Glu GluArg Gln Lys Gln Gln Met Ile Leu Glu Glu Met Lys Lys 780 785 790 795 ccgaca gag gat atg tgt ctg act gac cac cag ccc ctg cct gac ttc 3172 Pro ThrGlu Asp Met Cys Leu Thr Asp His Gln Pro Leu Pro Asp Phe 800 805 810 tcacga gtc cct ggt ctg aca ttg ccc agt gga gcc ttc tca gac tgc 3220 Ser ArgVal Pro Gly Leu Thr Leu Pro Ser Gly Ala Phe Ser Asp Cys 815 820 825 ttgacc att gtg gag ttc ctg cat agc ttt ggc aag gtg ctg ggc ttt 3268 Leu ThrIle Val Glu Phe Leu His Ser Phe Gly Lys Val Leu Gly Phe 830 835 840 gatcct gcc aaa gat gtg cct agc ctg ggg gtc ctg cag gag gga ctc 3316 Asp ProAla Lys Asp Val Pro Ser Leu Gly Val Leu Gln Glu Gly Leu 845 850 855 ctgtgt caa ggt gac agc ttg ggt gag gtg caa gac ctg ctg gtc agg 3364 Leu CysGln Gly Asp Ser Leu Gly Glu Val Gln Asp Leu Leu Val Arg 860 865 870 875ctg ctg aag gct gca ctc cat gat cct ggc ttt ccc tcc tac tgt cag 3412 LeuLeu Lys Ala Ala Leu His Asp Pro Gly Phe Pro Ser Tyr Cys Gln 880 885 890tcc cta aag atc ttg ggg gag aag gtg tct gaa atc cca ctg aca aga 3460 SerLeu Lys Ile Leu Gly Glu Lys Val Ser Glu Ile Pro Leu Thr Arg 895 900 905gac aat gtg tca gag atc ctg cgc tgc ttc ctt atg gca tat gga gta 3508 AspAsn Val Ser Glu Ile Leu Arg Cys Phe Leu Met Ala Tyr Gly Val 910 915 920nag cca gcc ctc tgt gac cgc ctg cgc acc cag cct ttt cag gcc cag 3556 XaaPro Ala Leu Cys Asp Arg Leu Arg Thr Gln Pro Phe Gln Ala Gln 925 930 935cca ccc cag cag aag gct gct gtc ctg gcc ttc cct gtg cat gag ctc 3604 ProPro Gln Gln Lys Ala Ala Val Leu Ala Phe Pro Val His Glu Leu 940 945 950955 aat ggc tcc acc ctc atc atc aat gag att gac aag act ctg gag agt 3652Asn Gly Ser Thr Leu Ile Ile Asn Glu Ile Asp Lys Thr Leu Glu Ser 960 965970 atg tcc agc tac agg aaa aac aag tgg att gtt gaa ggc cgg ctc cgg 3700Met Ser Ser Tyr Arg Lys Asn Lys Trp Ile Val Glu Gly Arg Leu Arg 975 980985 agg ctg aaa act gtt ctg gcc aag cga act ggg cgg tct gaa gta gag 3748Arg Leu Lys Thr Val Leu Ala Lys Arg Thr Gly Arg Ser Glu Val Glu 990 9951000 atg gga agg cca gag gaa tgc ctg gga cgg agg cgc agt tct cgg atc3796 Met Gly Arg Pro Glu Glu Cys Leu Gly Arg Arg Arg Ser Ser Arg Ile1005 1010 1015 atg gag gag acc agt ggc atg gaa gaa gag gaa gaa gag gagtct ata 3844 Met Glu Glu Thr Ser Gly Met Glu Glu Glu Glu Glu Glu Glu SerIle 1020 1025 1030 1035 gca gct gtc cct ggc cgc agg ggt cga aga gat ggagag gtt gat gcc 3892 Ala Ala Val Pro Gly Arg Arg Gly Arg Arg Asp Gly GluVal Asp Ala 1040 1045 1050 aca gca tct agc atc cca gag cta gag cgc cagata gaa aaa ctc agc 3940 Thr Ala Ser Ser Ile Pro Glu Leu Glu Arg Gln IleGlu Lys Leu Ser 1055 1060 1065 aag cgt cag ctt ttc ttt cgc aaa aag ctgctt cac tca tcc cag atg 3988 Lys Arg Gln Leu Phe Phe Arg Lys Lys Leu LeuHis Ser Ser Gln Met 1070 1075 1080 ctt cgg gcg gtc tcc ctg ggt cag gaccgc tac aga cgt cgc tac tgg 4036 Leu Arg Ala Val Ser Leu Gly Gln Asp ArgTyr Arg Arg Arg Tyr Trp 1085 1090 1095 gta ttg ccg tat ttg gct ggt atcttt gta gaa gga aca gag ggg aac 4084 Val Leu Pro Tyr Leu Ala Gly Ile PheVal Glu Gly Thr Glu Gly Asn 1100 1105 1110 1115 tta gtt cct gag gag gtgata aag aag gaa act gac tcc tta aaa gtg 4132 Leu Val Pro Glu Glu Val IleLys Lys Glu Thr Asp Ser Leu Lys Val 1120 1125 1130 gca gcc cat gcg tcactc aac cct gcc ctc ttc tct atg aag atg gag 4180 Ala Ala His Ala Ser LeuAsn Pro Ala Leu Phe Ser Met Lys Met Glu 1135 1140 1145 tta gct ggc tccaac acc act gcc agt tct cct gcc cgg gcc cga agc 4228 Leu Ala Gly Ser AsnThr Thr Ala Ser Ser Pro Ala Arg Ala Arg Ser 1150 1155 1160 cga cct ctaaaa act aag ccc ggg ttt atg caa cct agg cat ttt aag 4276 Arg Pro Leu LysThr Lys Pro Gly Phe Met Gln Pro Arg His Phe Lys 1165 1170 1175 tcc cctgtc agg ggt cag gat tca gaa cag ccc cag gcc cag ctt cag 4324 Ser Pro ValArg Gly Gln Asp Ser Glu Gln Pro Gln Ala Gln Leu Gln 1180 1185 1190 1195cct gag gct cag ctt cat gtt cct gcc cag ccc cag cct cag ctt cag 4372 ProGlu Ala Gln Leu His Val Pro Ala Gln Pro Gln Pro Gln Leu Gln 1200 12051210 ctt cag ctt cag tcc cat aag ggg ttc ctg gag caa gaa ggc tcc cct4420 Leu Gln Leu Gln Ser His Lys Gly Phe Leu Glu Gln Glu Gly Ser Pro1215 1220 1225 ttg tca ctg ggt cag agc cag cat gac ctc agc cag tca gccttc ctg 4468 Leu Ser Leu Gly Gln Ser Gln His Asp Leu Ser Gln Ser Ala PheLeu 1230 1235 1240 tct tgg ctg agc cag act cag agc cat agc tcc ctg ttgagc agc tca 4516 Ser Trp Leu Ser Gln Thr Gln Ser His Ser Ser Leu Leu SerSer Ser 1245 1250 1255 gtc ctc aca cct gat agc agt ccg gga aaa cta gaccca gct cca tca 4564 Val Leu Thr Pro Asp Ser Ser Pro Gly Lys Leu Asp ProAla Pro Ser 1260 1265 1270 1275 caa ccc ccg gag gag cca gag cct gat gaggca gaa tcc agc cct gat 4612 Gln Pro Pro Glu Glu Pro Glu Pro Asp Glu AlaGlu Ser Ser Pro Asp 1280 1285 1290 ctt caa gca ttc tgg ttt aac atc tcagcc cag atg ccc tgc aat gct 4660 Leu Gln Ala Phe Trp Phe Asn Ile Ser AlaGln Met Pro Cys Asn Ala 1295 1300 1305 gcc ccc aca ccg ccc ctt gca gtttct gag gac caa ccc act ccc tcc 4708 Ala Pro Thr Pro Pro Leu Ala Val SerGlu Asp Gln Pro Thr Pro Ser 1310 1315 1320 cct cag cag ctt gcc tcc tccaag cca atg aat aga cct agt gct gcc 4756 Pro Gln Gln Leu Ala Ser Ser LysPro Met Asn Arg Pro Ser Ala Ala 1325 1330 1335 aac cct tgt tct cca gtgcag ttc tct tcc acg ccc ttg gct ggg ttg 4804 Asn Pro Cys Ser Pro Val GlnPhe Ser Ser Thr Pro Leu Ala Gly Leu 1340 1345 1350 1355 gcc cct aag aggcga gca gga gac cct gga gaa atg cca cag agt ccc 4852 Ala Pro Lys Arg ArgAla Gly Asp Pro Gly Glu Met Pro Gln Ser Pro 1360 1365 1370 aca ggg ctggga cag ccc aaa cgg aga ggg aga cct ccc agt aag ttc 4900 Thr Gly Leu GlyGln Pro Lys Arg Arg Gly Arg Pro Pro Ser Lys Phe 1375 1380 1385 ttc aaacag atg gaa cag cgt tac cta acc cag ctg aca gcc cag cct 4948 Phe Lys GlnMet Glu Gln Arg Tyr Leu Thr Gln Leu Thr Ala Gln Pro 1390 1395 1400 gtccca cct gag atg tgc tca ggc tgg tgg tgg ata cca gat cct gag 4996 Val ProPro Glu Met Cys Ser Gly Trp Trp Trp Ile Pro Asp Pro Glu 1405 1410 1415atg ttg gat gcc atg ctc aag gcc cta cac ccc cga ggt atc cgg gag 5044 MetLeu Asp Ala Met Leu Lys Ala Leu His Pro Arg Gly Ile Arg Glu 1420 14251430 1435 aag gca ctt cac aaa cac ctt aac aag cac agg gac ttc ttg caggaa 5092 Lys Ala Leu His Lys His Leu Asn Lys His Arg Asp Phe Leu Gln Glu1440 1445 1450 gtc tgc ctg cgg ccc tca gct gac ccc atc ttt gag ccc aggcaa cta 5140 Val Cys Leu Arg Pro Ser Ala Asp Pro Ile Phe Glu Pro Arg GlnLeu 1455 1460 1465 cct gcc ttt caa gaa ggg att atg agc tgg tcc ccc aaagag aag aca 5188 Pro Ala Phe Gln Glu Gly Ile Met Ser Trp Ser Pro Lys GluLys Thr 1470 1475 1480 tac gag aca gac cta gca gtg ctt caa tgg gta gaggag ctg gag cag 5236 Tyr Glu Thr Asp Leu Ala Val Leu Gln Trp Val Glu GluLeu Glu Gln 1485 1490 1495 cgg gtt atc atg tct gat ctg cag att cgg ggctgg aca tgt cct agc 5284 Arg Val Ile Met Ser Asp Leu Gln Ile Arg Gly TrpThr Cys Pro Ser 1500 1505 1510 1515 cca gac tct acc cgt gaa gac ttg gcctac tgt gag cac ctc tcc gac 5332 Pro Asp Ser Thr Arg Glu Asp Leu Ala TyrCys Glu His Leu Ser Asp 1520 1525 1530 tcc cag gag gat atc acc tgg cgaggt ccg ggc agg gag gga ctg gca 5380 Ser Gln Glu Asp Ile Thr Trp Arg GlyPro Gly Arg Glu Gly Leu Ala 1535 1540 1545 cct caa cgt aaa act acc aaccct ttg gac ctg gct gtg atg cgg ctg 5428 Pro Gln Arg Lys Thr Thr Asn ProLeu Asp Leu Ala Val Met Arg Leu 1550 1555 1560 gct gcc ctg gaa caa aatgta aaa cgg cgg tac ctg cgg gag ccc ctc 5476 Ala Ala Leu Glu Gln Asn ValLys Arg Arg Tyr Leu Arg Glu Pro Leu 1565 1570 1575 tgg cca act cat gaggtt gtg ctg gag aaa gcc ctg ctt agc aca cct 5524 Trp Pro Thr His Glu ValVal Leu Glu Lys Ala Leu Leu Ser Thr Pro 1580 1585 1590 1595 aat ggt gcccct gag ggc acc act aca gag ata tca tat gag atc acc 5572 Asn Gly Ala ProGlu Gly Thr Thr Thr Glu Ile Ser Tyr Glu Ile Thr 1600 1605 1610 cct cgcatt cgt atc tgg cgc cag acc ctc cag cgg tgc cgg agc gca 5620 Pro Arg IleArg Ile Trp Arg Gln Thr Leu Gln Arg Cys Arg Ser Ala 1615 1620 1625 gcccat gtg tgc ttg tgc ctg ggc cat ctg gag agg tcc att gcc tgg 5668 Ala HisVal Cys Leu Cys Leu Gly His Leu Glu Arg Ser Ile Ala Trp 1630 1635 1640gag aag tct gtc aac aaa gtg aca tgt cta gtc tgc cgg aag ggt gac 5716 GluLys Ser Val Asn Lys Val Thr Cys Leu Val Cys Arg Lys Gly Asp 1645 16501655 aat gat gag ttt ctt ctg ctt tgt gat ggg tgt gac cgt ggc tgc cac5764 Asn Asp Glu Phe Leu Leu Leu Cys Asp Gly Cys Asp Arg Gly Cys His1660 1665 1670 1675 att tac tgc cat cgt ccc aag atg gag gct gtc cca gaagga gat tgg 5812 Ile Tyr Cys His Arg Pro Lys Met Glu Ala Val Pro Glu GlyAsp Trp 1680 1685 1690 ttc tgt act gtc tgt ttg gct cag cag gtg gag ggagaa ttc act cag 5860 Phe Cys Thr Val Cys Leu Ala Gln Gln Val Glu Gly GluPhe Thr Gln 1695 1700 1705 aag cct ggt ttc cca aag cgt ggc cag aag cggaaa agt ggt tat tcg 5908 Lys Pro Gly Phe Pro Lys Arg Gly Gln Lys Arg LysSer Gly Tyr Ser 1710 1715 1720 ctg aac ttc tca gaa ggt gat ggc cgc cgacgc cgg gta ctg ttg aag 5956 Leu Asn Phe Ser Glu Gly Asp Gly Arg Arg ArgArg Val Leu Leu Lys 1725 1730 1735 ggc cga gaa agc cca gca gca ggg cctcgg tac tcg gaa gaa agg ctc 6004 Gly Arg Glu Ser Pro Ala Ala Gly Pro ArgTyr Ser Glu Glu Arg Leu 1740 1745 1750 1755 tcc ccc tcc aag cgg cgg cgactc tct atg cgg aac cac cac agt gat 6052 Ser Pro Ser Lys Arg Arg Arg LeuSer Met Arg Asn His His Ser Asp 1760 1765 1770 ctc aca ttt tgc gag attatc ctg atg gag atg gaa tcc cat gat gca 6100 Leu Thr Phe Cys Glu Ile IleLeu Met Glu Met Glu Ser His Asp Ala 1775 1780 1785 gcc tgg cct ttc ntagag cct gtg aac cca cgt ttg gtg agt ggg tac 6148 Ala Trp Pro Phe Xaa GluPro Val Asn Pro Arg Leu Val Ser Gly Tyr 1790 1795 1800 cgg cgc atc atcaaa aat cct atg gat ttt tcc acc atg cgg gag cgg 6196 Arg Arg Ile Ile LysAsn Pro Met Asp Phe Ser Thr Met Arg Glu Arg 1805 1810 1815 ctg ctc agggga ggg tac acc agc tca gag gag ttt gcg gct gat gcc 6244 Leu Leu Arg GlyGly Tyr Thr Ser Ser Glu Glu Phe Ala Ala Asp Ala 1820 1825 1830 1835 ctcctg gta ttt gac aac tgc cag act ttc aac gag gat gac tct gaa 6292 Leu LeuVal Phe Asp Asn Cys Gln Thr Phe Asn Glu Asp Asp Ser Glu 1840 1845 1850gta ggc aag gct ggg cac atc atg cgc cgc ttc ttc gag agc cgc tgg 6340 ValGly Lys Ala Gly His Ile Met Arg Arg Phe Phe Glu Ser Arg Trp 1855 18601865 gag gag ttt tat cag gga aaa cag gcc aat ctg tgaggcaagg gaggtgggga6393 Glu Glu Phe Tyr Gln Gly Lys Gln Ala Asn Leu 1870 1875 gtcaccttgtggcatctccc cccaccttcc aaacaaaaac ctgccatttt cacctgctga 6453 tgctgccctgggtccagact caagtcagat acaaccctga tttttgacct tgcccttggc 6513 agtgccccacatcctcttat tcctacatcc ctttctccct tccctcctct tgctcctcaa 6573 gtaagaggtgcagagaggag tccttctgga ctaaaagcca aaaaaagaaa gaaaaaaata 6633 atttttcttttctgttttat ttgctaatta aaaatgggga gggggaaagt cgtccctact 6693 tcctcctccgtgcttcctct cctcccctgt acgtgcccca gcattctggg gttatttaac 6753 aatagcaatagttctagtga atgtgtgaaa ccaagaaaca ctctgtactg tgtgcggacc 6813 cgcagtgacggccagtaaag tggacttaac tcccaagtgt gtcgcggccg gacaccgggc 6873 cctggacatgctgcttccat gttcagtccc ttccctgctt ctcgctgtct ttcttttccc 6933 acctcccaccccccagtttt cagattttct ctcatccaat aatgtaaaac tatcgtgtac 6993 gggttcctccctccttttct cttctcccaa atcttttccc ttcaaaggaa aaaaaaatgt 7053 tcagaggtccctgtcttctg tccccatctt cctgccgata gctatcccct gtatgatgtt 7113 ggatgctcctcacatgctga gtttccagcc ttttctgaaa ctcattagct ggggagaggg 7173 cagggaggcttcctgggcct tccaacctcc ttccccacct ccttcccaaa ccctcttggg 7233 aactcctcagggacaactac tgctgagttt gggtgcaccc aaagatggag gccaagtagc 7293 aatggggccggcctcacaga gagcgccgtg tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 7353 tccgccacacaacgcgctgt gtgtgtgtgt gtgtgtctca cacacacagc gcgcagtgtg 7413 tgtgtgtgtgtgtgtgtgta tgtgtgtgtg tgtgtgaccc tgtattgttt gataggatcc 7473 attcagtttccccaagtacc tgttttcatt cccatttttc ccattgttta aaaccatcac 7533 ttttttgtctttgggaaacc acaggaacaa tttctctgga gacaaggctg tgtctctctc 7593 ctggtcatttttgttccagc ctcttcagac tgtgcatctt ttcagcagga actccctctc 7653 ttctcggtagctttgaatct taagcttcta cgggagagtg gtagaactgg atcatttcat 7713 aatcccatttagttgtgctt ttcttcatat acttcatacc ccaggacccc ttccccagca 7773 gcagagaccctggagcacag gagagtaggg aggaggggtt ctgggtccat cactgcctta 7833 catgtgactatgtccaagtt aagcccccaa cacgagagga aagctgctga ctcccagcta 7893 tagccatgggcacttggccc cctgcttttc ctgctcagca gagcccctcc cttcagagat 7953 tacgggtacttgcactgggg aggtggctgc tggctggccc aagcagagag ctgaggcatc 8013 caagaaatgttccattgggg ggtgggggtg ccaggtgagg tggagcattc cttgtattct 8073 ggcagcactgaagagccact gaagggggta ggggcagtgt aggtcctggg gcagcccctt 8133 tattcctttatgccccttct ccctcatagc ctatttctaa agtcgccttt tctgtcagat 8193 aaacctcaaaacttttaatt ttatttgaga tttttttttg cttttaagag gtggattgaa 8253 gaatatttgaattgacttta tattatgcat aaatatttat attttatcta aataactgcg 8313 ctgtaacaaactttgtgtta gacagttgaa actgttagag ttgggggctc tgctttttcc 8373 ccctggcaattttcccttgg tataagatgt gctagattaa tttcattgtg aggtggatgg 8433 gggagtgaaattgtgaggtg gatgggggag tgaaatctcc atggttcctg ctttgtgttc 8493 ctctcccagctccatctctt tccctaggga ccaggcactc atatggcggg gtggggtcct 8553 agcctcagtttgaagaagtg ggggctggag cggggttggg ggtggtaggg atagggcatg 8613 atcaaaggggccatttcttg cttttctttc ctcatcttca ctgccccctt gagctaggtg 8673 gattttctcttcatgacaag agtatttggt agggaaagca ggtttaaaat aaaaagacaa 8733 cccaccccctgcccttttgc ttccctccca tcagtctggt tgacaggaag aaaccacacc 8793 atcaacaccaacaagtttnt gtgttccttt tacagcaaaa gggacttttt atataaccaa 8853 atgtggtgttttagtgactt tttgataatg tacagttttt tgtgaattta aatttatttc 8913 tttctatatttttaggacca atttcatttt taataaggtt aaaaagaaaa aaaaagtcta 8973 gcgaaaaaactcctgttttt gccatgtgat gttccacaag tgcagctgta gaaaagtgct 9033 tgtcagttgttgaataaaaa aaccacattt gatagagatt caaaagactc tgtgtattca 9093 tcttcccttctacacacctg agggggagac ggcgttggga taggtatgac tggcttaaga 9153 gaccacaggcaagggaacaa caggggctcc tgttccatac cctctgtgtg ggatggaaag 9213 ggtcattagtgctcccgcct aaatgtctgg ctgagttgct ggaagcaaag gggggattca 9273 gtgcatccaggtcctgcctt gtgagatgtg gccccagctt cctaagctgc cacctctgtg 9333 ttcctgtcatagcaaatatg ggaccatcac cagcttacca cttcccactc acggataaga 9393 caccaagacgtagac 9408 <210> SEQ ID NO 15 <211> LENGTH: 24 <212> TYPE: DNA <213>ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION:Synthetically generated primer <400> SEQUENCE: 15 tgactctgaa gtaggcaaggctgg 24 <210> SEQ ID NO 16 <211> LENGTH: 22 <212> TYPE: DNA <213>ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION:Synthetically generated primer <400> SEQUENCE: 16 cttgcctcac agattggcctgt 22 <210> SEQ ID NO 17 <211> LENGTH: 21 <212> TYPE: DNA <213>ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION:Synthetically generated primer <400> SEQUENCE: 17 ttgccgtatt tggctggtatc 21 <210> SEQ ID NO 18 <211> LENGTH: 23 <212> TYPE: DNA <213> ORGANISM:Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION:Synthetically generated primer <400> SEQUENCE: 18 catagagaag agggcagggttga 23 <210> SEQ ID NO 19 <211> LENGTH: 20 <212> TYPE: DNA <213>ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION:Synthetically generated primer <400> SEQUENCE: 19 cttttgagca agttcagcct20 <210> SEQ ID NO 20 <211> LENGTH: 22 <212> TYPE: DNA <213> ORGANISM:Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION:Synthetically generated primer <400> SEQUENCE: 20 gtcggcttct tcatttcctcca 22 <210> SEQ ID NO 21 <211> LENGTH: 1972 <212> TYPE: PRT <213>ORGANISM: Homo sapiens <400> SEQUENCE: 21 Met Gly Gln Thr Lys Ser ThrSer Ser Gly Gly Gly Asn Arg Lys Cys 1 5 10 15 Asn Gln Glu Gln Ser LysAsn Gln Pro Leu Asp Ala Arg Val Asp Lys 20 25 30 Ile Lys Asp Lys Lys ProArg Lys Lys Ala Met Glu Ser Ser Ser Asn 35 40 45 Ser Asp Ser Asp Ser GlyThr Ser Ser Asp Thr Ser Ser Glu Gly Ile 50 55 60 Ser Ser Ser Asp Ser AspAsp Leu Glu Glu Asp Glu Glu Glu Glu Asp 65 70 75 80 Gln Ser Ile Glu GluSer Glu Asp Asp Asp Ser Asp Ser Glu Ser Glu 85 90 95 Ala Gln His Lys SerAsn Asn Gln Val Leu Leu His Gly Ile Ser Asp 100 105 110 Pro Lys Ala AspGly Gln Lys Ala Thr Glu Lys Ala Gln Glu Lys Arg 115 120 125 Ile His GlnPro Leu Pro Leu Ala Phe Glu Ser Gln Thr His Ser Phe 130 135 140 Gln SerGln Gln Lys Gln Pro Gln Val Leu Ser Gln Gln Leu Pro Phe 145 150 155 160Ile Phe Gln Ser Ser Gln Ala Lys Glu Glu Ser Val Asn Lys His Thr 165 170175 Ser Val Ile Gln Ser Thr Gly Leu Val Ser Asn Val Lys Pro Leu Ser 180185 190 Leu Val Asn Gln Ala Lys Lys Glu Thr Tyr Met Lys Leu Ile Val Pro195 200 205 Ser Pro Asp Val Leu Lys Ala Gly Asn Lys Asn Thr Ser Glu GluSer 210 215 220 Ser Leu Leu Thr Ser Glu Leu Arg Ser Lys Arg Glu Gln TyrLys Gln 225 230 235 240 Ala Phe Pro Ser Gln Leu Lys Lys Gln Glu Ser SerLys Ser Leu Lys 245 250 255 Lys Val Ile Ala Ala Leu Ser Asn Pro Lys AlaThr Ser Ser Ser Pro 260 265 270 Ala His Pro Lys Gln Thr Leu Glu Asn AsnHis Pro Asn Pro Phe Leu 275 280 285 Thr Asn Ala Leu Leu Gly Asn His GlnPro Asn Gly Val Ile Gln Ser 290 295 300 Val Ile Gln Glu Ala Pro Leu AlaLeu Thr Thr Lys Thr Lys Met Gln 305 310 315 320 Ser Lys Ile Asn Glu AsnIle Ala Ala Ala Ser Ser Thr Pro Phe Ser 325 330 335 Ser Pro Val Asn LeuSer Thr Ser Gly Arg Arg Thr Pro Gly Asn Gln 340 345 350 Thr Pro Val MetPro Ser Ala Ser Pro Ile Leu His Ser Gln Gly Lys 355 360 365 Glu Lys AlaVal Ser Asn Asn Val Asn Pro Val Lys Thr Gln His His 370 375 380 Ser HisPro Ala Lys Ser Leu Val Glu Gln Phe Arg Gly Thr Asp Ser 385 390 395 400Asp Ile Pro Ser Ser Lys Asp Ser Glu Asp Ser Asn Glu Asp Glu Glu 405 410415 Glu Asp Asp Glu Glu Glu Asp Glu Glu Asp Asp Glu Asp Asp Glu Ser 420425 430 Asp Asp Ser Gln Ser Glu Ser Asp Ser Asn Ser Glu Ser Asp Thr Glu435 440 445 Gly Ser Glu Glu Glu Asp Asp Asp Asp Lys Asp Gln Asp Glu SerAsp 450 455 460 Ser Asp Thr Glu Gly Glu Lys Thr Ser Met Lys Leu Asn LysThr Thr 465 470 475 480 Ser Ser Val Lys Ser Pro Ser Met Ser Leu Thr GlyHis Ser Thr Pro 485 490 495 Arg Asn Leu His Ile Ala Lys Ala Pro Gly SerAla Pro Ala Ala Leu 500 505 510 Cys Ser Glu Ser Gln Ser Pro Ala Phe LeuGly Thr Ser Ser Ser Thr 515 520 525 Leu Thr Ser Ser Pro His Ser Gly ThrSer Lys Arg Arg Arg Val Thr 530 535 540 Asp Glu Arg Glu Leu Arg Ile ProLeu Glu Tyr Gly Trp Gln Arg Glu 545 550 555 560 Thr Arg Ile Arg Asn PheGly Gly Arg Leu Gln Gly Glu Val Ala Tyr 565 570 575 Tyr Ala Pro Cys GlyLys Lys Leu Arg Gln Tyr Pro Glu Val Ile Lys 580 585 590 Tyr Leu Ser ArgAsn Gly Ile Met Asp Ile Ser Arg Asp Asn Phe Ser 595 600 605 Phe Ser AlaLys Ile Arg Val Gly Asp Phe Tyr Glu Ala Arg Asp Gly 610 615 620 Pro GlnGlu Met Gln Trp Cys Leu Leu Lys Glu Glu Asp Val Ile Pro 625 630 635 640Arg Ile Arg Ala Met Glu Gly Arg Arg Gly Arg Pro Pro Asn Pro Asp 645 650655 Arg Gln Arg Ala Arg Glu Glu Ser Arg Met Arg Arg Arg Lys Gly Arg 660665 670 Pro Pro Asn Val Gly Asn Ala Glu Phe Leu Asp Asn Ala Asp Ala Lys675 680 685 Leu Leu Arg Lys Leu Gln Ala Gln Glu Ile Ala Arg Gln Ala AlaGln 690 695 700 Ile Lys Leu Leu Arg Lys Leu Gln Lys Gln Glu Gln Ala ArgVal Ala 705 710 715 720 Lys Glu Ala Lys Lys Gln Gln Ala Ile Met Ala AlaGlu Glu Lys Arg 725 730 735 Lys Gln Lys Glu Gln Ile Lys Ile Met Lys GlnGln Glu Lys Ile Lys 740 745 750 Arg Ile Gln Gln Ile Arg Met Glu Lys GluLeu Arg Ala Gln Gln Ile 755 760 765 Leu Glu Ala Lys Lys Lys Lys Lys GluGlu Ala Ala Asn Ala Lys Leu 770 775 780 Leu Glu Ala Glu Lys Arg Ile LysGlu Lys Glu Met Arg Arg Gln Gln 785 790 795 800 Ala Val Leu Leu Lys HisGln Glu Arg Glu Arg Arg Arg Gln His Met 805 810 815 Met Leu Met Lys AlaMet Glu Ala Arg Lys Lys Ala Glu Glu Lys Glu 820 825 830 Arg Leu Lys GlnGlu Lys Arg Asp Glu Lys Arg Leu Asn Lys Glu Arg 835 840 845 Lys Leu GluGln Arg Arg Leu Glu Leu Glu Met Ala Lys Glu Leu Lys 850 855 860 Lys ProAsn Glu Asp Met Cys Leu Ala Asp Gln Lys Pro Leu Pro Glu 865 870 875 880Leu Pro Arg Ile Pro Gly Leu Val Leu Ser Gly Ser Thr Phe Ser Asp 885 890895 Cys Leu Met Val Val Gln Phe Leu Arg Asn Phe Gly Lys Val Leu Gly 900905 910 Phe Asp Val Asn Ile Asp Val Pro Asn Leu Ser Val Leu Gln Glu Gly915 920 925 Leu Leu Asn Ile Gly Asp Ser Met Gly Glu Val Gln Asp Leu LeuVal 930 935 940 Arg Leu Leu Ser Ala Ala Val Cys Asp Pro Gly Leu Ile ThrGly Tyr 945 950 955 960 Lys Ala Lys Thr Ala Leu Gly Glu His Leu Leu AsnVal Gly Val Asn 965 970 975 Arg Asp Asn Val Ser Glu Ile Leu Gln Ile PheMet Glu Ala His Cys 980 985 990 Gly Gln Thr Glu Leu Thr Glu Ser Leu LysThr Lys Ala Phe Gln Ala 995 1000 1005 His Thr Pro Ala Gln Lys Ala SerVal Leu Ala Phe Leu Ile Asn Glu 1010 1015 1020 Leu Ala Cys Ser Lys SerVal Val Ser Glu Ile Asp Lys Asn Ile Asp 1025 1030 1035 1040 Tyr Met SerAsn Leu Arg Arg Asp Lys Trp Val Val Glu Gly Lys Leu 1045 1050 1055 ArgLys Leu Arg Ile Ile His Ala Lys Lys Thr Gly Lys Arg Asp Thr 1060 10651070 Ser Gly Gly Ile Asp Leu Gly Glu Glu Gln His Pro Leu Gly Thr Pro1075 1080 1085 Thr Pro Gly Arg Lys Arg Arg Arg Lys Gly Gly Asp Ser AspTyr Asp 1090 1095 1100 Asp Asp Asp Asp Asp Asp Ser Asp Asp Gln Gly AspGlu Asp Asp Glu 1105 1110 1115 1120 Asp Glu Glu Asp Lys Glu Asp Gln LysGly Lys Lys Thr Asp Ile Cys 1125 1130 1135 Glu Asp Glu Asp Glu Gly AspGln Ala Ala Ser Val Glu Glu Leu Glu 1140 1145 1150 Lys Gln Ile Glu LysLeu Ser Lys Gln Gln Ser Gln Tyr Arg Arg Lys 1155 1160 1165 Leu Phe AspAla Ser His Ser Leu Arg Ser Val Met Phe Gly Pro Asp 1170 1175 1180 ArgTyr Arg Arg Arg Tyr Trp Ile Leu Pro Arg Cys Gly Gly Ile Phe 1185 11901195 1200 Val Glu Gly Met Glu Ser Gly Glu Gly Leu Glu Glu Ile Ala LysGlu 1205 1210 1215 Arg Glu Lys Leu Lys Lys Ala Glu Ser Val Gln Ile LysGlu Glu Met 1220 1225 1230 Phe Glu Thr Ser Gly Asp Ser Leu Asn Cys SerAsn Thr Asp His Cys 1235 1240 1245 Glu Gln Lys Glu Asp Leu Lys Glu LysAsp Asn Thr Asn Leu Phe Leu 1250 1255 1260 Gln Lys Pro Gly Ser Phe SerLys Leu Ser Lys Leu Leu Glu Val Ala 1265 1270 1275 1280 Lys Met Pro ProGlu Ser Glu Val Met Thr Pro Lys Pro Asn Ala Gly 1285 1290 1295 Ala AsnGly Cys Thr Leu Ser Tyr Gln Asn Ser Gly Lys His Ser Leu 1300 1305 1310Gly Ser Val Gln Ser Thr Ala Thr Gln Ser Asn Val Glu Lys Ala Asp 13151320 1325 Ser Asn Asn Leu Phe Asn Thr Gly Ser Ser Gly Pro Gly Lys PheTyr 1330 1335 1340 Ser Pro Leu Pro Asn Asp Gln Leu Leu Lys Thr Leu ThrGlu Lys Asn 1345 1350 1355 1360 Arg Gln Trp Phe Ser Leu Leu Pro Arg ThrPro Cys Asp Asp Thr Ser 1365 1370 1375 Leu Thr His Ala Asp Met Ser ThrAla Ser Leu Val Thr Pro Gln Ser 1380 1385 1390 Gln Pro Pro Ser Lys SerPro Ser Pro Thr Pro Ala Pro Leu Gly Ser 1395 1400 1405 Ser Ala Gln AsnPro Val Gly Leu Asn Pro Phe Ala Leu Ser Pro Leu 1410 1415 1420 Gln ValLys Gly Gly Val Ser Met Met Gly Leu Gln Phe Cys Gly Trp 1425 1430 14351440 Pro Thr Gly Val Val Thr Ser Asn Ile Pro Phe Thr Leu Ser Val Pro1445 1450 1455 Ser Leu Gly Ser Gly Leu Gly Leu Ser Glu Gly Asn Gly AsnSer Phe 1460 1465 1470 Leu Thr Ser Asn Val Ala Ser Ser Lys Ser Glu SerPro Val Pro Gln 1475 1480 1485 Asn Glu Lys Ala Thr Ser Ala Gln Pro AlaAla Val Glu Val Ala Lys 1490 1495 1500 Pro Val Asp Phe Pro Ser Pro LysPro Ile Pro Glu Glu Met Gln Phe 1505 1510 1515 1520 Gly Trp Trp Arg IleIle Asp Pro Glu Asp Leu Lys Ala Leu Leu Lys 1525 1530 1535 Val Leu HisLeu Arg Gly Ile Arg Glu Lys Ala Leu Gln Lys Gln Ile 1540 1545 1550 GlnLys His Leu Asp Tyr Ile Thr Gln Ala Cys Leu Lys Asn Lys Asp 1555 15601565 Val Ala Ile Ile Glu Leu Asn Glu Asn Glu Glu Asn Gln Val Thr Arg1570 1575 1580 Asp Ile Val Glu Asn Trp Ser Val Glu Glu Gln Ala Met GluMet Asp 1585 1590 1595 1600 Leu Ser Val Leu Gln Gln Val Glu Asp Leu GluArg Arg Val Ala Ser 1605 1610 1615 Ala Ser Leu Gln Val Lys Gly Trp MetCys Pro Glu Pro Ala Ser Glu 1620 1625 1630 Arg Glu Asp Leu Val Tyr PheGlu His Lys Ser Phe Thr Lys Leu Cys 1635 1640 1645 Lys Glu His Asp GlyGlu Phe Thr Gly Glu Asp Glu Ser Ser Ala His 1650 1655 1660 Ala Leu GluArg Lys Ser Asp Asn Pro Leu Asp Ile Ala Val Thr Arg 1665 1670 1675 1680Leu Ala Asp Leu Glu Arg Asn Ile Glu Arg Arg Ile Glu Glu Asp Ile 16851690 1695 Ala Pro Gly Leu Arg Val Trp Arg Arg Ala Leu Ser Glu Ala ArgSer 1700 1705 1710 Ala Ala Gln Val Ala Leu Cys Ile Gln Gln Leu Gln LysSer Ile Ala 1715 1720 1725 Trp Glu Lys Ser Ile Met Lys Val Tyr Cys GlnIle Cys Arg Lys Gly 1730 1735 1740 Asp Asn Glu Glu Leu Leu Leu Leu CysAsp Gly Cys Asp Lys Gly Cys 1745 1750 1755 1760 His Thr Tyr Cys His ArgPro Lys Ile Thr Thr Ile Pro Asp Gly Asp 1765 1770 1775 Trp Phe Cys ProAla Cys Ile Ala Lys Ala Ser Gly Gln Thr Leu Lys 1780 1785 1790 Ile LysLys Leu His Val Lys Gly Lys Lys Thr Asn Glu Ser Lys Lys 1795 1800 1805Gly Lys Lys Val Thr Leu Thr Gly Asp Thr Glu Asp Glu Asp Ser Ala 18101815 1820 Ser Thr Ser Ser Ser Leu Lys Arg Gly Asn Lys Asp Leu Gln LysArg 1825 1830 1835 1840 Lys Met Glu Glu Asn Thr Ser Ile Asn Leu Ser LysGln Glu Ser Phe 1845 1850 1855 Thr Ser Val Lys Lys Pro Lys Arg Asp AspSer Lys Asp Leu Ala Leu 1860 1865 1870 Cys Ser Met Ile Leu Thr Glu MetGlu Thr His Glu Asp Ala Trp Pro 1875 1880 1885 Phe Leu Leu Pro Val AsnLeu Lys Leu Val Pro Gly Tyr Lys Lys Val 1890 1895 1900 Ile Lys Lys ProMet Asp Phe Ser Thr Ile Arg Glu Lys Leu Ser Ser 1905 1910 1915 1920 GlyGln Tyr Pro Asn Leu Glu Thr Phe Ala Leu Asp Val Arg Leu Val 1925 19301935 Phe Asp Asn Cys Glu Thr Phe Asn Glu Asp Asp Ser Asp Ile Gly Arg1940 1945 1950 Ala Gly His Asn Met Arg Lys Tyr Phe Glu Lys Lys Trp ThrAsp Thr 1955 1960 1965 Phe Lys Val Ser 1970 <210> SEQ ID NO 22 <211>LENGTH: 7585 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <220> FEATURE:<221> NAME/KEY: CDS <222> LOCATION: (367)...(6282) <400> SEQUENCE: 22ggtctcgatc tcctgacctt gtgatccacc tcctcggcct cccaaagtgc tgggattaca 60ggcatgagcc acggcaccca gcctcatttg ctgttaaact catttattga gtcacctttt 120tcttcctcac actttttagt cttagaattt ttgtgtgttt ttatttaccc taacctgtca 180atttcatagt ttccactttc ttgttgaagt ttccaaactt gacctcatgc ctttgaatat 240actaattcta ttgctttgac acattttttt cccgaaaaag gtgtaaatgg gtcaataaat 300ggaagtaata catcatctgt aattggtatc aacacatctg tactatccac tactgcttca 360agttcc atg gga caa act aaa agt aca agc tca ggt gga gga aat cga 408 MetGly Gln Thr Lys Ser Thr Ser Ser Gly Gly Gly Asn Arg 1 5 10 aaa tgt aatcag gaa caa agc aaa aac cag cct ttg gat gct aga gtt 456 Lys Cys Asn GlnGlu Gln Ser Lys Asn Gln Pro Leu Asp Ala Arg Val 15 20 25 30 gac aaa atcaaa gat aag aaa cca agg aag aaa gca atg gaa agt tct 504 Asp Lys Ile LysAsp Lys Lys Pro Arg Lys Lys Ala Met Glu Ser Ser 35 40 45 agc aac agt gatagt gat tca ggc aca tca tca gac acc tca agt gaa 552 Ser Asn Ser Asp SerAsp Ser Gly Thr Ser Ser Asp Thr Ser Ser Glu 50 55 60 ggc att agt agc agtgat tca gat gat cta gaa gaa gat gaa gaa gaa 600 Gly Ile Ser Ser Ser AspSer Asp Asp Leu Glu Glu Asp Glu Glu Glu 65 70 75 gaa gat caa agt att gaagaa agt gaa gat gat gat tct gat tca gag 648 Glu Asp Gln Ser Ile Glu GluSer Glu Asp Asp Asp Ser Asp Ser Glu 80 85 90 agt gaa gca caa cat aaa agtaac aac cag gtg cta tta cat ggt att 696 Ser Glu Ala Gln His Lys Ser AsnAsn Gln Val Leu Leu His Gly Ile 95 100 105 110 tca gac cca aaa gca gatgga cag aaa gca act gaa aaa gcc cag gaa 744 Ser Asp Pro Lys Ala Asp GlyGln Lys Ala Thr Glu Lys Ala Gln Glu 115 120 125 aaa aga ata cac cag ccatta cct ctt gcg ttt gaa tcc cag act cac 792 Lys Arg Ile His Gln Pro LeuPro Leu Ala Phe Glu Ser Gln Thr His 130 135 140 tca ttc caa tcc cag cagaag cag cct cag gtt ttg tca cag cag ctt 840 Ser Phe Gln Ser Gln Gln LysGln Pro Gln Val Leu Ser Gln Gln Leu 145 150 155 cca ttt att ttc caa agctct cag gca aag gag gaa tct gtg aac aaa 888 Pro Phe Ile Phe Gln Ser SerGln Ala Lys Glu Glu Ser Val Asn Lys 160 165 170 cac acc agt gta ata cagtct acg gga ttg gtg tcc aat gtg aaa cct 936 His Thr Ser Val Ile Gln SerThr Gly Leu Val Ser Asn Val Lys Pro 175 180 185 190 tta tct ttg gta aatcaa gcc aaa aag gaa act tac atg aaa ctc ata 984 Leu Ser Leu Val Asn GlnAla Lys Lys Glu Thr Tyr Met Lys Leu Ile 195 200 205 gtt cct tct cct gatgtt ctt aaa gca ggg aat aaa aat acc tct gaa 1032 Val Pro Ser Pro Asp ValLeu Lys Ala Gly Asn Lys Asn Thr Ser Glu 210 215 220 gaa tct agt tta ttgacc agt gaa ttg aga tcc aaa cgg gaa caa tat 1080 Glu Ser Ser Leu Leu ThrSer Glu Leu Arg Ser Lys Arg Glu Gln Tyr 225 230 235 aaa cag gca ttc ccatca cag tta aag aaa caa gag tca tcg aag agc 1128 Lys Gln Ala Phe Pro SerGln Leu Lys Lys Gln Glu Ser Ser Lys Ser 240 245 250 ctg aag aag gtt attgca gct ttg tca aat cca aaa gca acc tct agt 1176 Leu Lys Lys Val Ile AlaAla Leu Ser Asn Pro Lys Ala Thr Ser Ser 255 260 265 270 tca cca gca catcca aaa caa aca tta gaa aac aac cac cca aat cca 1224 Ser Pro Ala His ProLys Gln Thr Leu Glu Asn Asn His Pro Asn Pro 275 280 285 ttc ttg aca aatgca ctt tta ggt aat cac caa cca aat gga gtt att 1272 Phe Leu Thr Asn AlaLeu Leu Gly Asn His Gln Pro Asn Gly Val Ile 290 295 300 caa agt gtc attcaa gaa gct cct cta gca ctt act acc aaa act aaa 1320 Gln Ser Val Ile GlnGlu Ala Pro Leu Ala Leu Thr Thr Lys Thr Lys 305 310 315 atg cag agc aagatt aat gaa aac att gct gct gca agt agc acc cct 1368 Met Gln Ser Lys IleAsn Glu Asn Ile Ala Ala Ala Ser Ser Thr Pro 320 325 330 ttt tcc tca cctgta aat ctg agt aca agt ggg aga aga acc cct ggc 1416 Phe Ser Ser Pro ValAsn Leu Ser Thr Ser Gly Arg Arg Thr Pro Gly 335 340 345 350 aat cag acacct gta atg cct tct gcc tct ccc atc ctg cat agt caa 1464 Asn Gln Thr ProVal Met Pro Ser Ala Ser Pro Ile Leu His Ser Gln 355 360 365 ggg aag gaaaaa gca gtt agc aat aat gta aac cca gta aaa aca cag 1512 Gly Lys Glu LysAla Val Ser Asn Asn Val Asn Pro Val Lys Thr Gln 370 375 380 cat cac tcccat cct gca aaa tct tta gtg gaa caa ttc aga gga aca 1560 His His Ser HisPro Ala Lys Ser Leu Val Glu Gln Phe Arg Gly Thr 385 390 395 gat tca gacatt ccc agt agt aaa gat tct gaa gat tca aat gag gat 1608 Asp Ser Asp IlePro Ser Ser Lys Asp Ser Glu Asp Ser Asn Glu Asp 400 405 410 gaa gag gaagat gat gaa gaa gaa gat gag gaa gat gat gaa gat gat 1656 Glu Glu Glu AspAsp Glu Glu Glu Asp Glu Glu Asp Asp Glu Asp Asp 415 420 425 430 gaa tctgat gac agc caa tca gaa tca gat agt aat tca gaa tca gat 1704 Glu Ser AspAsp Ser Gln Ser Glu Ser Asp Ser Asn Ser Glu Ser Asp 435 440 445 aca gaagga tca gaa gaa gaa gat gat gat gat aaa gac caa gat gaa 1752 Thr Glu GlySer Glu Glu Glu Asp Asp Asp Asp Lys Asp Gln Asp Glu 450 455 460 tca gatagt gat act gaa gga gag aaa act tca atg aaa ctg aat aaa 1800 Ser Asp SerAsp Thr Glu Gly Glu Lys Thr Ser Met Lys Leu Asn Lys 465 470 475 aca acttcc tct gtc aaa agc cct tcc atg agt ctc aca ggt cac tca 1848 Thr Thr SerSer Val Lys Ser Pro Ser Met Ser Leu Thr Gly His Ser 480 485 490 aca cctcgt aac ctc cac ata gca aaa gcc cca ggc tct gct cct gct 1896 Thr Pro ArgAsn Leu His Ile Ala Lys Ala Pro Gly Ser Ala Pro Ala 495 500 505 510 gcctta tgt tct gaa tcc cag tca cct gct ttt ctt ggt aca tct tct 1944 Ala LeuCys Ser Glu Ser Gln Ser Pro Ala Phe Leu Gly Thr Ser Ser 515 520 525 tccaca ctt act tca agc cca cac tct ggc act tcc aaa aga aga aga 1992 Ser ThrLeu Thr Ser Ser Pro His Ser Gly Thr Ser Lys Arg Arg Arg 530 535 540 gtaaca gat gaa cgt gaa ctg cgt att cca ttg gaa tat ggc tgg cag 2040 Val ThrAsp Glu Arg Glu Leu Arg Ile Pro Leu Glu Tyr Gly Trp Gln 545 550 555 agagag aca aga ata aga aac ttt gga ggg cgc ctt caa gga gaa gta 2088 Arg GluThr Arg Ile Arg Asn Phe Gly Gly Arg Leu Gln Gly Glu Val 560 565 570 gcatat tat gct cca tgt gga aag aaa ctt agg cag tac cct gaa gta 2136 Ala TyrTyr Ala Pro Cys Gly Lys Lys Leu Arg Gln Tyr Pro Glu Val 575 580 585 590ata aag tat ctc agc aga aat gga ata atg gat atc tca agg gac aat 2184 IleLys Tyr Leu Ser Arg Asn Gly Ile Met Asp Ile Ser Arg Asp Asn 595 600 605ttc agc ttc agt gca aaa ata aga gtg ggt gac ttc tat gaa gcc aga 2232 PheSer Phe Ser Ala Lys Ile Arg Val Gly Asp Phe Tyr Glu Ala Arg 610 615 620gat gga ccg cag gaa atg cag tgg tgt ctt ttg aaa gaa gag gat gtc 2280 AspGly Pro Gln Glu Met Gln Trp Cys Leu Leu Lys Glu Glu Asp Val 625 630 635att cct cgt atc agg gca atg gaa ggt cgt aga gga aga cca cca aat 2328 IlePro Arg Ile Arg Ala Met Glu Gly Arg Arg Gly Arg Pro Pro Asn 640 645 650cca gat aga caa cga gca aga gag gaa tcc agg atg aga cgt cgg aaa 2376 ProAsp Arg Gln Arg Ala Arg Glu Glu Ser Arg Met Arg Arg Arg Lys 655 660 665670 ggt cga cct cca aat gtt ggc aat gct gaa ttc cta gat aac gca gat 2424Gly Arg Pro Pro Asn Val Gly Asn Ala Glu Phe Leu Asp Asn Ala Asp 675 680685 gca aag ttg cta aga aaa ctg caa gct caa gaa ata gcc agg caa gca 2472Ala Lys Leu Leu Arg Lys Leu Gln Ala Gln Glu Ile Ala Arg Gln Ala 690 695700 gca caa ata aag ctt ttg aga aaa ctt caa aag cag gaa cag gct cgg 2520Ala Gln Ile Lys Leu Leu Arg Lys Leu Gln Lys Gln Glu Gln Ala Arg 705 710715 gtt gct aaa gaa gcc aaa aaa caa caa gca ata atg gct gct gag gag 2568Val Ala Lys Glu Ala Lys Lys Gln Gln Ala Ile Met Ala Ala Glu Glu 720 725730 aag cgg aag caa aaa gaa cag ata aag att atg aaa cag cag gaa aaa 2616Lys Arg Lys Gln Lys Glu Gln Ile Lys Ile Met Lys Gln Gln Glu Lys 735 740745 750 att aag aga ata cag caa atc aga atg gaa aaa gaa ctt cga gct cag2664 Ile Lys Arg Ile Gln Gln Ile Arg Met Glu Lys Glu Leu Arg Ala Gln 755760 765 caa att cta gag gct aaa aag aaa aag aag gaa gaa gcg gca aat gcc2712 Gln Ile Leu Glu Ala Lys Lys Lys Lys Lys Glu Glu Ala Ala Asn Ala 770775 780 aaa tta ttg gag gcc gag aaa cga ata aag gaa aaa gaa atg aga aga2760 Lys Leu Leu Glu Ala Glu Lys Arg Ile Lys Glu Lys Glu Met Arg Arg 785790 795 caa caa gct gtt ctt ctg aaa cat cag gaa cga gag cga agg cga caa2808 Gln Gln Ala Val Leu Leu Lys His Gln Glu Arg Glu Arg Arg Arg Gln 800805 810 cac atg atg ctt atg aaa gct atg gaa gct cgt aaa aaa gca gaa gaa2856 His Met Met Leu Met Lys Ala Met Glu Ala Arg Lys Lys Ala Glu Glu 815820 825 830 aaa gag cgg ttg aaa caa gaa aaa cgt gat gag aaa aga tta aataaa 2904 Lys Glu Arg Leu Lys Gln Glu Lys Arg Asp Glu Lys Arg Leu Asn Lys835 840 845 gag cgt aaa cta gag cag cga aga tta gaa tta gaa atg gca aaggaa 2952 Glu Arg Lys Leu Glu Gln Arg Arg Leu Glu Leu Glu Met Ala Lys Glu850 855 860 cta aag aag cct aat gaa gac atg tgc tta gca gac caa aag cctttg 3000 Leu Lys Lys Pro Asn Glu Asp Met Cys Leu Ala Asp Gln Lys Pro Leu865 870 875 cca gag ttg cct cgt att cca gga ctt gtt ctc tct gga agt acattt 3048 Pro Glu Leu Pro Arg Ile Pro Gly Leu Val Leu Ser Gly Ser Thr Phe880 885 890 tca gac tgt ctc atg gtg gtg cag ttc tta cga aac ttt ggt aaagtt 3096 Ser Asp Cys Leu Met Val Val Gln Phe Leu Arg Asn Phe Gly Lys Val895 900 905 910 ttg ggc ttt gat gtg aat att gat gtt cca aac ctg agt gttctt caa 3144 Leu Gly Phe Asp Val Asn Ile Asp Val Pro Asn Leu Ser Val LeuGln 915 920 925 gag gga ttg cta aat ata ggg gac agc atg ggt gaa gta caagac ttg 3192 Glu Gly Leu Leu Asn Ile Gly Asp Ser Met Gly Glu Val Gln AspLeu 930 935 940 ctt gtg agg ctc ctc tca gct gct gta tgt gat cca ggt ctaata aca 3240 Leu Val Arg Leu Leu Ser Ala Ala Val Cys Asp Pro Gly Leu IleThr 945 950 955 gga tac aag gct aaa aca gct ctt gga gaa cat ttg ctg aatgtt ggt 3288 Gly Tyr Lys Ala Lys Thr Ala Leu Gly Glu His Leu Leu Asn ValGly 960 965 970 gtg aat cga gac aat gtt tcc gag att tta cag ata ttt atggaa gcc 3336 Val Asn Arg Asp Asn Val Ser Glu Ile Leu Gln Ile Phe Met GluAla 975 980 985 990 cac tgt gga caa act gag ctt act gaa agt ctg aag accaaa gct ttt 3384 His Cys Gly Gln Thr Glu Leu Thr Glu Ser Leu Lys Thr LysAla Phe 995 1000 1005 cag gct cac act cca gca cag aaa gct tca gtc ctggct ttc ctg atc 3432 Gln Ala His Thr Pro Ala Gln Lys Ala Ser Val Leu AlaPhe Leu Ile 1010 1015 1020 aat gaa ctg gca tgc agc aag agt gtg gtc agtgaa atc gac aag aac 3480 Asn Glu Leu Ala Cys Ser Lys Ser Val Val Ser GluIle Asp Lys Asn 1025 1030 1035 att gat tat atg tca aac ttg agg aga gataaa tgg gtg gta gaa ggt 3528 Ile Asp Tyr Met Ser Asn Leu Arg Arg Asp LysTrp Val Val Glu Gly 1040 1045 1050 aaa ctc cgc aag ctc aga atc att catgct aag aaa aca ggc aaa aga 3576 Lys Leu Arg Lys Leu Arg Ile Ile His AlaLys Lys Thr Gly Lys Arg 1055 1060 1065 1070 gac act tca ggt ggc att gatctg gga gaa gag cag cat ccc ttg ggc 3624 Asp Thr Ser Gly Gly Ile Asp LeuGly Glu Glu Gln His Pro Leu Gly 1075 1080 1085 aca ccc act cca gga cgcaag cga aga agg aag gga gga gac agt gat 3672 Thr Pro Thr Pro Gly Arg LysArg Arg Arg Lys Gly Gly Asp Ser Asp 1090 1095 1100 tat gac gat gat gatgac gat gac agt gat gac caa ggg gat gaa gat 3720 Tyr Asp Asp Asp Asp AspAsp Asp Ser Asp Asp Gln Gly Asp Glu Asp 1105 1110 1115 gat gag gat gaagaa gat aag gaa gac caa aaa gga aaa aag act gat 3768 Asp Glu Asp Glu GluAsp Lys Glu Asp Gln Lys Gly Lys Lys Thr Asp 1120 1125 1130 atc tgt gaagat gag gat gaa ggt gac caa gca gca agt gtt gaa gag 3816 Ile Cys Glu AspGlu Asp Glu Gly Asp Gln Ala Ala Ser Val Glu Glu 1135 1140 1145 1150 ctggaa aaa cag att gaa aaa ctg agt aaa caa cag agt cag tac aga 3864 Leu GluLys Gln Ile Glu Lys Leu Ser Lys Gln Gln Ser Gln Tyr Arg 1155 1160 1165agg aag ctc ttt gat gcg tct cac tca ttg cgt tca gtg atg ttt ggc 3912 ArgLys Leu Phe Asp Ala Ser His Ser Leu Arg Ser Val Met Phe Gly 1170 11751180 cca gat cgt tac aga cgc cgg tac tgg att ctt ccc cga tgt ggg ggg3960 Pro Asp Arg Tyr Arg Arg Arg Tyr Trp Ile Leu Pro Arg Cys Gly Gly1185 1190 1195 att ttt gta gaa ggc atg gag agt ggt gaa gga cta gaa gaaatt gca 4008 Ile Phe Val Glu Gly Met Glu Ser Gly Glu Gly Leu Glu Glu IleAla 1200 1205 1210 aaa gaa aga gaa aaa ctg aaa aag gca gaa agt gtc cagatc aaa gaa 4056 Lys Glu Arg Glu Lys Leu Lys Lys Ala Glu Ser Val Gln IleLys Glu 1215 1220 1225 1230 gaa atg ttt gag act tct ggg gac agt tta aattgt tca aat aca gat 4104 Glu Met Phe Glu Thr Ser Gly Asp Ser Leu Asn CysSer Asn Thr Asp 1235 1240 1245 cac tgt gaa caa aag gaa gat ctt aaa gaaaaa gat aac aca aat cta 4152 His Cys Glu Gln Lys Glu Asp Leu Lys Glu LysAsp Asn Thr Asn Leu 1250 1255 1260 ttc ctt cag aaa cct ggc tct ttt tccaaa tta agc aag ctt ttg gaa 4200 Phe Leu Gln Lys Pro Gly Ser Phe Ser LysLeu Ser Lys Leu Leu Glu 1265 1270 1275 gta gct aag atg cct cct gag tcagag gtt atg acc ccc aaa cca aat 4248 Val Ala Lys Met Pro Pro Glu Ser GluVal Met Thr Pro Lys Pro Asn 1280 1285 1290 gct ggt gca aat ggg tgc acgttg tct tat cag aac agt gga aaa cat 4296 Ala Gly Ala Asn Gly Cys Thr LeuSer Tyr Gln Asn Ser Gly Lys His 1295 1300 1305 1310 tca ctg ggc agc gttcag tca aca gca acg caa agc aat gtg gaa aag 4344 Ser Leu Gly Ser Val GlnSer Thr Ala Thr Gln Ser Asn Val Glu Lys 1315 1320 1325 gca gac tct aataat ctg ttt aat act ggt tca agt ggt cca ggg aag 4392 Ala Asp Ser Asn AsnLeu Phe Asn Thr Gly Ser Ser Gly Pro Gly Lys 1330 1335 1340 ttc tac agtcct ctc ccc aat gac cag tta cta aaa acg ctg act gaa 4440 Phe Tyr Ser ProLeu Pro Asn Asp Gln Leu Leu Lys Thr Leu Thr Glu 1345 1350 1355 aag aataga caa tgg ttt agt ctt ttg cca cga aca ccc tgt gat gac 4488 Lys Asn ArgGln Trp Phe Ser Leu Leu Pro Arg Thr Pro Cys Asp Asp 1360 1365 1370 acttca ctt act cat gcc gat atg tca act gct tct ttg gtg act cct 4536 Thr SerLeu Thr His Ala Asp Met Ser Thr Ala Ser Leu Val Thr Pro 1375 1380 13851390 cag tct cag cca cca tct aag tca cct tca cct acc cca gct cct ctt4584 Gln Ser Gln Pro Pro Ser Lys Ser Pro Ser Pro Thr Pro Ala Pro Leu1395 1400 1405 gga tct tct gct cag aat cct gtt ggc tta aat cca ttt gcttta tca 4632 Gly Ser Ser Ala Gln Asn Pro Val Gly Leu Asn Pro Phe Ala LeuSer 1410 1415 1420 cct ctt cag gtg aaa ggt gga gta tct atg atg gga cttcag ttt tgt 4680 Pro Leu Gln Val Lys Gly Gly Val Ser Met Met Gly Leu GlnPhe Cys 1425 1430 1435 gga tgg ccc act ggt gtg gtt act tct aat att ccattt aca tta tct 4728 Gly Trp Pro Thr Gly Val Val Thr Ser Asn Ile Pro PheThr Leu Ser 1440 1445 1450 gta cct agt cta gga tcg ggg tta ggg tta tcagaa gga aat ggt aat 4776 Val Pro Ser Leu Gly Ser Gly Leu Gly Leu Ser GluGly Asn Gly Asn 1455 1460 1465 1470 tca ttc ttg act tcc aat gtt gct tcaagt aaa agt gaa tct cca gta 4824 Ser Phe Leu Thr Ser Asn Val Ala Ser SerLys Ser Glu Ser Pro Val 1475 1480 1485 cca cag aat gaa aag gcc act tcagct caa cct gca gct gtt gaa gta 4872 Pro Gln Asn Glu Lys Ala Thr Ser AlaGln Pro Ala Ala Val Glu Val 1490 1495 1500 gca aaa cca gta gat ttt cctagt cca aaa cct att cca gaa gaa atg 4920 Ala Lys Pro Val Asp Phe Pro SerPro Lys Pro Ile Pro Glu Glu Met 1505 1510 1515 cag ttt ggt tgg tgg agaatt att gac cca gag gac cta aaa gct ttg 4968 Gln Phe Gly Trp Trp Arg IleIle Asp Pro Glu Asp Leu Lys Ala Leu 1520 1525 1530 ctc aaa gtg ctg catctc aga gga ata aga gaa aag gca tta caa aaa 5016 Leu Lys Val Leu His LeuArg Gly Ile Arg Glu Lys Ala Leu Gln Lys 1535 1540 1545 1550 caa att cagaaa cat ttg gat tat att act caa gcc tgc ctc aag aat 5064 Gln Ile Gln LysHis Leu Asp Tyr Ile Thr Gln Ala Cys Leu Lys Asn 1555 1560 1565 aag gatgtt gct att att gaa ctg aat gaa aat gaa gaa aac caa gta 5112 Lys Asp ValAla Ile Ile Glu Leu Asn Glu Asn Glu Glu Asn Gln Val 1570 1575 1580 actcga gat att gtg gag aac tgg tca gta gaa gaa caa gca atg gaa 5160 Thr ArgAsp Ile Val Glu Asn Trp Ser Val Glu Glu Gln Ala Met Glu 1585 1590 1595atg gat ttg agt gtc ctt caa cag gta gaa gat cta gaa agg aga gtt 5208 MetAsp Leu Ser Val Leu Gln Gln Val Glu Asp Leu Glu Arg Arg Val 1600 16051610 gca tcc gca agt ttg caa gtg aag ggt tgg atg tgt cca gag cct gca5256 Ala Ser Ala Ser Leu Gln Val Lys Gly Trp Met Cys Pro Glu Pro Ala1615 1620 1625 1630 tca gaa agg gag gac ttg gta tat ttt gaa cat aaa tcattt act aaa 5304 Ser Glu Arg Glu Asp Leu Val Tyr Phe Glu His Lys Ser PheThr Lys 1635 1640 1645 ttg tgc aag gag cat gat gga gaa ttt act ggc gaagac gaa agc agt 5352 Leu Cys Lys Glu His Asp Gly Glu Phe Thr Gly Glu AspGlu Ser Ser 1650 1655 1660 gca cat gca cta gaa cgg aag agt gac aac ccccta gat ata gct gtt 5400 Ala His Ala Leu Glu Arg Lys Ser Asp Asn Pro LeuAsp Ile Ala Val 1665 1670 1675 acc agg ctg gct gat ttg gag cgg aac attgaa aga aga att gag gaa 5448 Thr Arg Leu Ala Asp Leu Glu Arg Asn Ile GluArg Arg Ile Glu Glu 1680 1685 1690 gat att gct cca ggg ctc agg gtg tggaga agg gca tta tca gaa gct 5496 Asp Ile Ala Pro Gly Leu Arg Val Trp ArgArg Ala Leu Ser Glu Ala 1695 1700 1705 1710 cgc agt gct gca cag gta gctctg tgc att cag caa tta cag aaa tca 5544 Arg Ser Ala Ala Gln Val Ala LeuCys Ile Gln Gln Leu Gln Lys Ser 1715 1720 1725 ata gca tgg gaa aaa tcaatt atg aaa gtt tac tgc caa atc tgt cga 5592 Ile Ala Trp Glu Lys Ser IleMet Lys Val Tyr Cys Gln Ile Cys Arg 1730 1735 1740 aag gga gat aat gaagaa ctg ctt ctt ctt tgt gat ggc tgt gac aaa 5640 Lys Gly Asp Asn Glu GluLeu Leu Leu Leu Cys Asp Gly Cys Asp Lys 1745 1750 1755 ggc tgt cat acctac tgc cat aga ccc aag att aca aca atc cca gat 5688 Gly Cys His Thr TyrCys His Arg Pro Lys Ile Thr Thr Ile Pro Asp 1760 1765 1770 gga gac tggttt tgt cca gct tgc att gct aag gca agt ggt caa act 5736 Gly Asp Trp PheCys Pro Ala Cys Ile Ala Lys Ala Ser Gly Gln Thr 1775 1780 1785 1790 ctaaaa atc aaa aaa ctt cat gtc aaa gga aaa aag act aat gag tct 5784 Leu LysIle Lys Lys Leu His Val Lys Gly Lys Lys Thr Asn Glu Ser 1795 1800 1805aag aaa ggc aag aag gta act tta aca gga gat act gaa gat gaa gac 5832 LysLys Gly Lys Lys Val Thr Leu Thr Gly Asp Thr Glu Asp Glu Asp 1810 18151820 tct gca tct aca agt agt tca cta aaa aga gga aac aaa gac ctc cag5880 Ser Ala Ser Thr Ser Ser Ser Leu Lys Arg Gly Asn Lys Asp Leu Gln1825 1830 1835 aaa aga aaa atg gag gaa aac act tct att aac ttg tca aaacaa gaa 5928 Lys Arg Lys Met Glu Glu Asn Thr Ser Ile Asn Leu Ser Lys GlnGlu 1840 1845 1850 agt ttt act tca gtt aag aaa cct aaa aga gat gac tccaag gac cta 5976 Ser Phe Thr Ser Val Lys Lys Pro Lys Arg Asp Asp Ser LysAsp Leu 1855 1860 1865 1870 gct ctt tgc agt atg att ctg act gaa atg gaaact cat gag gat gca 6024 Ala Leu Cys Ser Met Ile Leu Thr Glu Met Glu ThrHis Glu Asp Ala 1875 1880 1885 tgg cct ttt cta ctt cct gta aac ttg aaactt gtt cct ggt tat aag 6072 Trp Pro Phe Leu Leu Pro Val Asn Leu Lys LeuVal Pro Gly Tyr Lys 1890 1895 1900 aaa gtt att aag aag cct atg gat ttttcc aca att aga gag aaa cta 6120 Lys Val Ile Lys Lys Pro Met Asp Phe SerThr Ile Arg Glu Lys Leu 1905 1910 1915 agt agt gga cag tat cca aac cttgaa acc ttt gct cta gat gtc agg 6168 Ser Ser Gly Gln Tyr Pro Asn Leu GluThr Phe Ala Leu Asp Val Arg 1920 1925 1930 ctt gtt ttt gac aac tgt gaaaca ttt aat gaa gat gat tct gat ata 6216 Leu Val Phe Asp Asn Cys Glu ThrPhe Asn Glu Asp Asp Ser Asp Ile 1935 1940 1945 1950 ggc aga gct ggc cacaat atg agg aag tat ttt gaa aaa aag tgg aca 6264 Gly Arg Ala Gly His AsnMet Arg Lys Tyr Phe Glu Lys Lys Trp Thr 1955 1960 1965 gat act ttc aaagtg agc tgaagttata ataatctctt tatttttttc 6312 Asp Thr Phe Lys Val Ser1970 cttctaaaca aggacaaatg agaccagcaa tgtgaactgt atttacataa acgtgcaagg6372 cacatacata atgactttct ttttccttaa gtataaaaaa aaagtatcag aagaatgata6432 ccatttttaa aggcttcatt cctacaacaa ccaaggccct cggttattgg tttgtgtgat6492 ttatcagcta atttaggtag aacagggaag cacacccaaa gaattttcaa aggaaagggt6552 gttatagtgc aatagcaatt aaaatatatc aaatcgcact gaatactcaa caccagagct6612 ctaatgtggg aaatggttct cctttccctc tcaataaata tctatttttc atttttttac6672 tttgtagttt attttttagt gaatgtattt aattttatga attatttatg attaaaccac6732 atccagaatc ttcgttttct gtgaaaagga agaactagaa aattgcttta aatcttgaaa6792 atacaaggaa tgttttaaaa tataaaacaa agccaagtta aactgtttac actgatgtgc6852 tataaaagca ccaaaaagaa actttactgt agagttacaa gtacatttat atatatatgt6912 tgctgcatca cttgtgtagt taaattgtat ttcaaaacag tgaaaaaatt gacatgtata6972 tactgttcat tcttgtttat attaagtctt gttttaaata tgtattatgt gtatatattg7032 tttgcagaca ttattgttca tgccttagag gattgtagca ttttattttc gtctgaaggt7092 aatgatagct atacagtctg tacagtaatt atcctctacc aacactgtgg cgtctcctta7152 atcttggtag tgcctgcctt tgaaacaggg tgtaggggat attagttttc catttttcta7212 ttttgttata taattttaag ccaccagggc ctaaattaaa gtataatcat ttgtatccat7272 gtggaataaa attgtgacaa tttcctacgc acacagtatt ttttcataga aacatttccc7332 tcccatttgc cttgcctcag aaataaattt aaaagacgtt tgtaaccact gtgttttatc7392 tactgtgtgt tgtggtggcc tgttggaggc aaatagatca gatttttttt gtacctacgt7452 aagagtactt gaagttttat ttaaaataaa atgttgtgga aaaggtagca ttcttttttt7512 aggagtgtta tttttcacta tgtgtggcac ggatacaata aaagactttt acaaactaaa7572 aaaaaaaaaa aaa 7585 <210> SEQ ID NO 23 <211> LENGTH: 25 <212> TYPE:DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHERINFORMATION: Synthetically generated primer <400> SEQUENCE: 23ctgactgaaa tggaaactca tgagg 25 <210> SEQ ID NO 24 <211> LENGTH: 25 <212>TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHERINFORMATION: Synthetically generated primer <400> SEQUENCE: 24ctagagcaaa ggtttcaagg tttgg 25 <210> SEQ ID NO 25 <211> LENGTH: 24 <212>TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHERINFORMATION: Synthetically generated primer <400> SEQUENCE: 25tgttgctgca tcacttgtgt agtt 24 <210> SEQ ID NO 26 <211> LENGTH: 23 <212>TYPE: DNA <213> ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHERINFORMATION: Synthetically generated primer <400> SEQUENCE: 26ggcatgacaa taatgtctgc aaa 23 <210> SEQ ID NO 27 <211> LENGTH: 1527 <212>TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 27 Met Ala ProLeu Leu Gly Arg Lys Pro Phe Pro Leu Val Asn Pro Leu 1 5 10 15 Pro GlyGlu Glu Pro Phe Phe Thr Ile Pro His Thr Gln Glu Ala Phe 20 25 30 Arg ThrArg Glu Glu Tyr Glu Ala Arg Leu Glu Arg Tyr Ser Glu Arg 35 40 45 Ile TrpThr Cys Lys Ser Thr Gly Ser Ser Gln Leu Thr His Lys Glu 50 55 60 Ala TrpGlu Glu Glu Gln Glu Val Ala Glu Leu Leu Lys Glu Glu Phe 65 70 75 80 ProAla Trp Tyr Glu Lys Leu Val Leu Glu Met Val His His Asn Thr 85 90 95 AlaSer Leu Glu Lys Leu Val Asp Thr Ala Trp Leu Glu Ile Met Thr 100 105 110Lys Tyr Ala Val Gly Glu Glu Cys Asp Phe Glu Val Gly Lys Glu Lys 115 120125 Met Leu Lys Val Lys Ile Val Lys Ile His Pro Leu Glu Lys Val Asp 130135 140 Glu Glu Ala Thr Glu Lys Lys Ser Asp Gly Ala Cys Asp Ser Pro Ser145 150 155 160 Ser Asp Lys Glu Asn Ser Ser Gln Ile Ala Gln Asp His GlnLys Lys 165 170 175 Glu Thr Val Val Lys Glu Asp Glu Gly Arg Arg Glu SerIle Asn Asp 180 185 190 Arg Ala Arg Arg Ser Pro Arg Lys Leu Pro Thr SerLeu Lys Lys Gly 195 200 205 Glu Arg Lys Trp Ala Pro Pro Lys Phe Leu ProHis Lys Tyr Asp Val 210 215 220 Lys Leu Gln Asn Glu Asp Lys Ile Ile SerAsn Val Pro Ala Asp Ser 225 230 235 240 Leu Ile Arg Thr Glu Arg Pro ProAsn Lys Glu Ile Val Arg Tyr Phe 245 250 255 Ile Arg His Asn Ala Leu ArgAla Gly Thr Gly Glu Asn Ala Pro Trp 260 265 270 Val Val Glu Asp Glu LeuVal Lys Lys Tyr Ser Leu Pro Ser Lys Phe 275 280 285 Ser Asp Phe Leu LeuAsp Pro Tyr Lys Tyr Met Thr Leu Asn Pro Ser 290 295 300 Thr Lys Arg LysAsn Thr Gly Ser Pro Asp Arg Lys Pro Ser Lys Lys 305 310 315 320 Ser LysThr Asp Asn Ser Ser Leu Ser Ser Pro Leu Asn Pro Lys Leu 325 330 335 TrpCys His Val His Leu Lys Lys Ser Leu Ser Gly Ser Pro Leu Lys 340 345 350Val Lys Asn Ser Lys Asn Ser Lys Ser Pro Glu Glu His Leu Glu Glu 355 360365 Met Met Lys Met Met Ser Pro Asn Lys Leu His Thr Asn Phe His Ile 370375 380 Pro Lys Lys Gly Pro Pro Ala Lys Lys Pro Gly Lys His Ser Asp Lys385 390 395 400 Pro Leu Lys Ala Lys Gly Arg Ser Lys Gly Ile Leu Asn GlyGln Lys 405 410 415 Ser Thr Gly Asn Ser Lys Ser Pro Lys Lys Gly Leu LysThr Pro Lys 420 425 430 Thr Lys Met Lys Gln Met Thr Leu Leu Asp Met AlaLys Gly Thr Gln 435 440 445 Lys Met Thr Arg Ala Pro Arg Asn Ser Gly GlyThr Pro Arg Thr Ser 450 455 460 Ser Lys Pro His Lys His Leu Pro Pro AlaAla Leu His Leu Ile Ala 465 470 475 480 Tyr Tyr Lys Glu Asn Lys Asp ArgGlu Asp Lys Arg Ser Ala Leu Ser 485 490 495 Cys Val Ile Ser Lys Thr AlaArg Leu Leu Ser Ser Glu Asp Arg Ala 500 505 510 Arg Leu Pro Glu Glu LeuArg Ser Leu Val Gln Lys Arg Tyr Glu Leu 515 520 525 Leu Glu His Lys LysArg Trp Ala Ser Met Ser Glu Glu Gln Arg Lys 530 535 540 Glu Tyr Leu LysLys Lys Arg Glu Glu Leu Lys Lys Lys Leu Lys Glu 545 550 555 560 Lys AlaLys Glu Arg Arg Glu Lys Glu Met Leu Glu Arg Leu Glu Lys 565 570 575 GlnLys Arg Tyr Glu Asp Gln Glu Leu Thr Gly Lys Asn Leu Pro Ala 580 585 590Phe Arg Leu Val Asp Thr Pro Glu Gly Leu Pro Asn Thr Leu Phe Gly 595 600605 Asp Val Ala Met Val Val Glu Phe Leu Ser Cys Tyr Ser Gly Leu Leu 610615 620 Leu Pro Asp Ala Gln Tyr Pro Ile Thr Ala Val Ser Leu Met Glu Ala625 630 635 640 Leu Ser Ala Asp Lys Gly Gly Phe Leu Tyr Leu Asn Arg ValLeu Val 645 650 655 Ile Leu Leu Gln Thr Leu Leu Gln Asp Glu Ile Ala GluAsp Tyr Gly 660 665 670 Glu Leu Gly Met Lys Leu Ser Glu Ile Pro Leu ThrLeu His Ser Val 675 680 685 Ser Glu Leu Val Arg Leu Cys Leu Arg Arg SerAsp Val Gln Glu Glu 690 695 700 Ser Glu Gly Ser Asp Thr Asp Asp Asn LysAsp Ser Ala Ala Phe Glu 705 710 715 720 Asp Asn Glu Val Gln Asp Glu PheLeu Glu Lys Leu Glu Thr Ser Glu 725 730 735 Phe Phe Glu Leu Thr Ser GluGlu Lys Leu Gln Ile Leu Thr Ala Leu 740 745 750 Cys His Arg Ile Leu MetThr Tyr Ser Val Gln Asp His Met Glu Thr 755 760 765 Arg Gln Gln Met SerAla Glu Leu Trp Lys Glu Arg Leu Ala Val Leu 770 775 780 Lys Glu Glu AsnAsp Lys Lys Arg Ala Glu Lys Gln Lys Arg Lys Glu 785 790 795 800 Met GluAla Lys Asn Lys Glu Asn Gly Lys Val Glu Asn Gly Leu Gly 805 810 815 LysThr Asp Arg Lys Lys Arg Ile Val Lys Phe Glu Pro Gln Val Asp 820 825 830Thr Glu Ala Glu Asp Met Ile Ser Ala Val Lys Ser Arg Arg Leu Leu 835 840845 Ala Ile Gln Ala Lys Lys Glu Arg Glu Ile Gln Glu Arg Glu Met Lys 850855 860 Val Lys Leu Glu Arg Gln Ala Glu Glu Glu Arg Ile Arg Lys His Lys865 870 875 880 Ala Ala Ala Glu Lys Ala Phe Gln Glu Gly Ile Ala Lys AlaLys Leu 885 890 895 Val Met Arg Arg Thr Pro Ile Gly Thr Asp Arg Asn HisAsn Arg Tyr 900 905 910 Trp Leu Phe Ser Asp Glu Val Pro Gly Leu Phe IleGlu Lys Gly Trp 915 920 925 Val His Asp Ser Ile Asp Tyr Arg Phe Asn HisHis Cys Lys Asp His 930 935 940 Thr Val Ser Gly Asp Glu Asp Tyr Cys ProArg Ser Lys Lys Ala Asn 945 950 955 960 Leu Gly Lys Asn Ala Ser Met AsnThr Gln His Gly Thr Ala Thr Glu 965 970 975 Val Ala Val Glu Thr Thr ThrPro Lys Gln Gly Gln Asn Leu Trp Phe 980 985 990 Leu Cys Asp Ser Gln LysGlu Leu Asp Glu Leu Leu Asn Cys Leu His 995 1000 1005 Pro Gln Gly IleArg Glu Ser Gln Leu Lys Glu Arg Leu Glu Lys Arg 1010 1015 1020 Tyr GlnAsp Ile Ile His Ser Ile His Leu Ala Arg Lys Pro Asn Leu 1025 1030 10351040 Gly Leu Lys Ser Cys Asp Gly Asn Gln Glu Leu Leu Asn Phe Leu Arg1045 1050 1055 Ser Asp Leu Ile Glu Val Ala Thr Arg Leu Gln Lys Gly GlyLeu Gly 1060 1065 1070 Tyr Val Glu Glu Thr Ser Glu Phe Glu Ala Arg ValIle Ser Leu Glu 1075 1080 1085 Lys Leu Lys Asp Phe Gly Glu Cys Val IleAla Leu Gln Ala Ser Val 1090 1095 1100 Ile Lys Lys Phe Leu Gln Gly PheMet Ala Pro Lys Gln Lys Arg Arg 1105 1110 1115 1120 Lys Leu Gln Ser GluAsp Ser Ala Lys Thr Glu Glu Val Asp Glu Glu 1125 1130 1135 Lys Lys MetVal Glu Glu Ala Lys Val Ala Ser Ala Leu Glu Lys Trp 1140 1145 1150 LysThr Ala Ile Arg Glu Ala Gln Thr Phe Ser Arg Met His Val Leu 1155 11601165 Leu Gly Met Leu Asp Ala Cys Ile Lys Trp Asp Met Ser Ala Glu Asn1170 1175 1180 Ala Arg Cys Lys Val Cys Pro Lys Lys Gly Glu Asp Asp LysLeu Ile 1185 1190 1195 1200 Leu Cys Asp Glu Cys Asn Lys Ala Phe His LeuPhe Cys Leu Arg Pro 1205 1210 1215 Ala Leu Tyr Glu Val Pro Asp Gly GluTrp Gln Cys Pro Ala Cys Gln 1220 1225 1230 Pro Ala Thr Ala Arg Arg AsnSer Arg Gly Arg Asn Tyr Thr Glu Glu 1235 1240 1245 Ser Ala Ser Glu AspSer Glu Asp Asp Glu Ser Asp Glu Glu Glu Glu 1250 1255 1260 Glu Glu GluGlu Glu Glu Glu Glu Glu Asp Tyr Glu Val Ala Gly Leu 1265 1270 1275 1280Arg Leu Arg Pro Arg Lys Thr Ile Arg Gly Lys His Ser Val Ile Pro 12851290 1295 Pro Ala Ala Arg Ser Gly Arg Arg Pro Gly Lys Lys Pro His SerThr 1300 1305 1310 Arg Arg Ser Gln Pro Lys Ala Pro Pro Val Asp Asp AlaGlu Val Asp 1315 1320 1325 Glu Leu Val Leu Gln Thr Lys Arg Ser Ser ArgArg Gln Ser Leu Glu 1330 1335 1340 Leu Gln Lys Cys Glu Glu Ile Leu HisLys Ile Val Lys Tyr Arg Phe 1345 1350 1355 1360 Ser Trp Pro Phe Arg GluPro Val Thr Arg Asp Glu Ala Glu Asp Tyr 1365 1370 1375 Tyr Asp Val IleThr His Pro Met Asp Phe Gln Thr Val Gln Asn Lys 1380 1385 1390 Cys SerCys Gly Ser Tyr Arg Ser Val Gln Glu Phe Leu Thr Asp Met 1395 1400 1405Lys Gln Val Phe Thr Asn Ala Glu Val Tyr Asn Cys Arg Gly Ser His 14101415 1420 Val Leu Ser Cys Met Val Lys Thr Glu Gln Cys Leu Val Val LeuLeu 1425 1430 1435 1440 His Lys His Leu Pro Gly His Pro Tyr Val Arg ArgLys Arg Lys Lys 1445 1450 1455 Phe Pro Asp Arg Leu Ala Glu Asp Glu GlyAsp Ser Glu Pro Glu Ala 1460 1465 1470 Val Gly Gln Ser Arg Asp Glu AspArg Arg Ser Arg Glu Ala Glu Ile 1475 1480 1485 Gln Glu Trp Leu Gln AspThr Ser Leu Tyr Ser Ala Lys Ile Asn Ser 1490 1495 1500 Lys Asp His AsnCys Phe Met Met Leu Val Asn Thr Gln Phe Cys Met 1505 1510 1515 1520 AlaLeu Thr Asp Thr Val Thr 1525 <210> SEQ ID NO 28 <211> LENGTH: 5561 <212>TYPE: DNA <213> ORGANISM: Homo sapiens <220> FEATURE: <221> NAME/KEY:CDS <222> LOCATION: (346)...(4926) <400> SEQUENCE: 28 cgggcccgggggaggagggg aatctcccgc catttttcaa taatttcctc cggtgctgct 60 gaggaggagtcgtgactgcc ggccgccggg acccgaagcg gaggtcggcg gggggctgct 120 gggaggcgcggcggtgtgcg cgggagctct gcgccgtggc gttccgctcc atgactgtcg 180 cgcggccgcgccggcggtga gggagccgga gttcgcgccg ccctctcacc cctcccttcc 240 cccaccccacccccgggcgc ctggcgctcg ctccgggccg cggggcctag tgctgcgccg 300 cggggccggccccagcagcc gccagtcccc accgccgccg ccgcg atg gcg ccg ctc 357 Met Ala ProLeu 1 ctg ggc cgc aag ccc ttc ccg ctg gtg aat ccg ttg ccc gga gag gag405 Leu Gly Arg Lys Pro Phe Pro Leu Val Asn Pro Leu Pro Gly Glu Glu 5 1015 20 ccg ttc ttc acc atc ccg cac act cag gag gcc ttc cgc acc cgg gaa453 Pro Phe Phe Thr Ile Pro His Thr Gln Glu Ala Phe Arg Thr Arg Glu 2530 35 gag tat gaa gcc cgc ttg gaa agg tac agt gag cgc att tgg acg tgc501 Glu Tyr Glu Ala Arg Leu Glu Arg Tyr Ser Glu Arg Ile Trp Thr Cys 4045 50 aag agt act gga agc agt cag cta aca cac aag gaa gcc tgg gag gaa549 Lys Ser Thr Gly Ser Ser Gln Leu Thr His Lys Glu Ala Trp Glu Glu 5560 65 gaa cag gaa gtt gct gag ctt ttg aag gag gag ttt cct gcc tgg tat597 Glu Gln Glu Val Ala Glu Leu Leu Lys Glu Glu Phe Pro Ala Trp Tyr 7075 80 gag aag ctt gtt ctg gaa atg gtt cac cat aac aca gcc tcc tta gag645 Glu Lys Leu Val Leu Glu Met Val His His Asn Thr Ala Ser Leu Glu 8590 95 100 aag tta gta gat act gct tgg ttg gag atc atg acc aaa tat gctgtg 693 Lys Leu Val Asp Thr Ala Trp Leu Glu Ile Met Thr Lys Tyr Ala Val105 110 115 gga gaa gag tgt gac ttc gag gtt ggg aag gag aaa atg ctc aaggtg 741 Gly Glu Glu Cys Asp Phe Glu Val Gly Lys Glu Lys Met Leu Lys Val120 125 130 aag att gtg aag att cat cct ttg gag aaa gtg gat gaa gag gccact 789 Lys Ile Val Lys Ile His Pro Leu Glu Lys Val Asp Glu Glu Ala Thr135 140 145 gag aag aaa tct gat ggt gcc tgt gat tct cca tca agt gac aaagag 837 Glu Lys Lys Ser Asp Gly Ala Cys Asp Ser Pro Ser Ser Asp Lys Glu150 155 160 aac tcc agt cag att gct cag gac cat cag aag aag gag aca gttgtg 885 Asn Ser Ser Gln Ile Ala Gln Asp His Gln Lys Lys Glu Thr Val Val165 170 175 180 aaa gag gat gaa gga agg aga gag agt att aat gac aga gcacgt aga 933 Lys Glu Asp Glu Gly Arg Arg Glu Ser Ile Asn Asp Arg Ala ArgArg 185 190 195 tcg cca cga aaa ctt cct act tca tta aaa aaa gga gaa aggaaa tgg 981 Ser Pro Arg Lys Leu Pro Thr Ser Leu Lys Lys Gly Glu Arg LysTrp 200 205 210 gct cct cca aaa ttt ctg cct cac aaa tat gat gtg aaa ctacaa aat 1029 Ala Pro Pro Lys Phe Leu Pro His Lys Tyr Asp Val Lys Leu GlnAsn 215 220 225 gaa gat aag atc atc agt aac gtg cca gca gac agc ttg attcgt aca 1077 Glu Asp Lys Ile Ile Ser Asn Val Pro Ala Asp Ser Leu Ile ArgThr 230 235 240 gag cgc cca cca aat aag gag ata gtt cga tac ttt ata cggcat aat 1125 Glu Arg Pro Pro Asn Lys Glu Ile Val Arg Tyr Phe Ile Arg HisAsn 245 250 255 260 gca tta cga gct ggt act ggt gaa aat gca cct tgg gtcgta gaa gat 1173 Ala Leu Arg Ala Gly Thr Gly Glu Asn Ala Pro Trp Val ValGlu Asp 265 270 275 gaa ttg gtg aag aaa tac tct ctg ccc agc aag ttc agtgac ttt tta 1221 Glu Leu Val Lys Lys Tyr Ser Leu Pro Ser Lys Phe Ser AspPhe Leu 280 285 290 ctt gat cca tac aag tat atg act ctc aac cct tct actaag agg aag 1269 Leu Asp Pro Tyr Lys Tyr Met Thr Leu Asn Pro Ser Thr LysArg Lys 295 300 305 aat act gga tcc cca gac agg aag ccc tca aag aaa tccaag aca gac 1317 Asn Thr Gly Ser Pro Asp Arg Lys Pro Ser Lys Lys Ser LysThr Asp 310 315 320 aac tct tct ctt agt tca cca cta aat cct aag tta tggtgt cac gta 1365 Asn Ser Ser Leu Ser Ser Pro Leu Asn Pro Lys Leu Trp CysHis Val 325 330 335 340 cac ttg aag aag tca ttg agt ggc tcg cca ctc aaagtg aag aac tca 1413 His Leu Lys Lys Ser Leu Ser Gly Ser Pro Leu Lys ValLys Asn Ser 345 350 355 aag aat tcc aaa tct cct gaa gaa cat cta gaa gaaatg atg aag atg 1461 Lys Asn Ser Lys Ser Pro Glu Glu His Leu Glu Glu MetMet Lys Met 360 365 370 atg tcg ccc aat aag ttg cac act aac ttt cac attcct aaa aaa ggc 1509 Met Ser Pro Asn Lys Leu His Thr Asn Phe His Ile ProLys Lys Gly 375 380 385 cca cct gcc aag aaa cca ggg aag cac agt gac aagcct ttg aag gca 1557 Pro Pro Ala Lys Lys Pro Gly Lys His Ser Asp Lys ProLeu Lys Ala 390 395 400 aag ggc aga agc aaa ggc atc ctg aat gga cag aaatcc aca ggg aat 1605 Lys Gly Arg Ser Lys Gly Ile Leu Asn Gly Gln Lys SerThr Gly Asn 405 410 415 420 tcc aaa tct ccc aaa aaa gga ctg aag act cctaaa acc aaa atg aag 1653 Ser Lys Ser Pro Lys Lys Gly Leu Lys Thr Pro LysThr Lys Met Lys 425 430 435 cag atg act ttg ttg gat atg gcc aaa ggc acgcag aag atg aca cga 1701 Gln Met Thr Leu Leu Asp Met Ala Lys Gly Thr GlnLys Met Thr Arg 440 445 450 gcc cca cgg aat tct ggg ggt aca cct agg acctct agt aaa cct cat 1749 Ala Pro Arg Asn Ser Gly Gly Thr Pro Arg Thr SerSer Lys Pro His 455 460 465 aaa cat ctg cct cct gca gcc cta cac ctc attgca tac tac aaa gaa 1797 Lys His Leu Pro Pro Ala Ala Leu His Leu Ile AlaTyr Tyr Lys Glu 470 475 480 aac aaa gac agg gag gac aag agg agc gcc ctgtcc tgt gtt atc tcc 1845 Asn Lys Asp Arg Glu Asp Lys Arg Ser Ala Leu SerCys Val Ile Ser 485 490 495 500 aaa aca gct cgt ctt ctc tct agt gaa gataga gct cgt ctc cca gaa 1893 Lys Thr Ala Arg Leu Leu Ser Ser Glu Asp ArgAla Arg Leu Pro Glu 505 510 515 gaa ttg cga agt ctt gtt caa aaa cgc tatgaa ctt cta gag cac aaa 1941 Glu Leu Arg Ser Leu Val Gln Lys Arg Tyr GluLeu Leu Glu His Lys 520 525 530 aag agg tgg gct tct atg tct gaa gaa caacgg aaa gaa tat ttg aaa 1989 Lys Arg Trp Ala Ser Met Ser Glu Glu Gln ArgLys Glu Tyr Leu Lys 535 540 545 aag aaa cgg gag gag ctg aaa aag aag ttgaag gaa aaa gcc aaa gaa 2037 Lys Lys Arg Glu Glu Leu Lys Lys Lys Leu LysGlu Lys Ala Lys Glu 550 555 560 cga aga gag aaa gaa atg ctt gag aga ttagaa aaa cag aag cgg tat 2085 Arg Arg Glu Lys Glu Met Leu Glu Arg Leu GluLys Gln Lys Arg Tyr 565 570 575 580 gag gac caa gag tta act ggc aaa aacctt cca gca ttc aga ttg gtg 2133 Glu Asp Gln Glu Leu Thr Gly Lys Asn LeuPro Ala Phe Arg Leu Val 585 590 595 gat acc cct gaa ggg ctg ccc aac acgctg ttt ggg gat gtg gcc atg 2181 Asp Thr Pro Glu Gly Leu Pro Asn Thr LeuPhe Gly Asp Val Ala Met 600 605 610 gtg gtg gaa ttc ttg agc tgt tat tctggg cta ctt tta cca gat gct 2229 Val Val Glu Phe Leu Ser Cys Tyr Ser GlyLeu Leu Leu Pro Asp Ala 615 620 625 cag tat cct att act gct gtg tcc cttatg gaa gcc ttg agt gca gat 2277 Gln Tyr Pro Ile Thr Ala Val Ser Leu MetGlu Ala Leu Ser Ala Asp 630 635 640 aag ggt ggc ttt tta tac ctt aac agggtg ttg gtc atc ctc tta cag 2325 Lys Gly Gly Phe Leu Tyr Leu Asn Arg ValLeu Val Ile Leu Leu Gln 645 650 655 660 acc ctc cta caa gat gag ata gccgaa gac tat ggt gaa ttg gga atg 2373 Thr Leu Leu Gln Asp Glu Ile Ala GluAsp Tyr Gly Glu Leu Gly Met 665 670 675 aag ctg tca gaa atc ccc ttg actctg cat tct gtt tca gag ctg gtg 2421 Lys Leu Ser Glu Ile Pro Leu Thr LeuHis Ser Val Ser Glu Leu Val 680 685 690 cgg ctc tgc ttg cgc aga tct gatgtt cag gag gaa agc gag ggc tca 2469 Arg Leu Cys Leu Arg Arg Ser Asp ValGln Glu Glu Ser Glu Gly Ser 695 700 705 gac aca gat gac aat aaa gat tcagct gca ttt gag gat aat gag gta 2517 Asp Thr Asp Asp Asn Lys Asp Ser AlaAla Phe Glu Asp Asn Glu Val 710 715 720 caa gat gag ttc cta gaa aag ctggag acc tct gaa ttt ttt gag ctg 2565 Gln Asp Glu Phe Leu Glu Lys Leu GluThr Ser Glu Phe Phe Glu Leu 725 730 735 740 acg tca gag gag aag cta cagatt ttg aca gca ctg tgc cac cgg atc 2613 Thr Ser Glu Glu Lys Leu Gln IleLeu Thr Ala Leu Cys His Arg Ile 745 750 755 ctc atg aca tac tca gtg caagac cac atg gag acc aga cag cag atg 2661 Leu Met Thr Tyr Ser Val Gln AspHis Met Glu Thr Arg Gln Gln Met 760 765 770 tct gca gag ttg tgg aag gaacgg ctt gct gtg ttg aag gaa gaa aat 2709 Ser Ala Glu Leu Trp Lys Glu ArgLeu Ala Val Leu Lys Glu Glu Asn 775 780 785 gat aag aag aga gca gag aaacag aaa cgg aaa gaa atg gaa gcc aaa 2757 Asp Lys Lys Arg Ala Glu Lys GlnLys Arg Lys Glu Met Glu Ala Lys 790 795 800 aat aaa gaa aat gga aaa gttgag aat ggg tta ggc aaa act gat agg 2805 Asn Lys Glu Asn Gly Lys Val GluAsn Gly Leu Gly Lys Thr Asp Arg 805 810 815 820 aaa aaa aga att gtg aagttt gag ccc caa gta gat aca gaa gct gaa 2853 Lys Lys Arg Ile Val Lys PheGlu Pro Gln Val Asp Thr Glu Ala Glu 825 830 835 gac atg att agt gct gtgaag agc aga agg ttg ctt gcc att caa gct 2901 Asp Met Ile Ser Ala Val LysSer Arg Arg Leu Leu Ala Ile Gln Ala 840 845 850 aag aag gaa cgg gaa atccag gaa aga gaa atg aaa gtg aaa ctg gaa 2949 Lys Lys Glu Arg Glu Ile GlnGlu Arg Glu Met Lys Val Lys Leu Glu 855 860 865 cgc caa gct gaa gaa gaacga ata cgg aag cac aaa gca gct gct gag 2997 Arg Gln Ala Glu Glu Glu ArgIle Arg Lys His Lys Ala Ala Ala Glu 870 875 880 aaa gct ttc cag gaa gggatt gcc aag gcc aaa cta gtc atg cgc agg 3045 Lys Ala Phe Gln Glu Gly IleAla Lys Ala Lys Leu Val Met Arg Arg 885 890 895 900 act cct att ggc acagat cga aac cat aat aga tac tgg ctc ttc tca 3093 Thr Pro Ile Gly Thr AspArg Asn His Asn Arg Tyr Trp Leu Phe Ser 905 910 915 gat gaa gtt cca ggatta ttc att gaa aaa ggc tgg gta cat gac agc 3141 Asp Glu Val Pro Gly LeuPhe Ile Glu Lys Gly Trp Val His Asp Ser 920 925 930 att gac tac cga ttcaac cat cac tgc aaa gac cac aca gtc tct ggt 3189 Ile Asp Tyr Arg Phe AsnHis His Cys Lys Asp His Thr Val Ser Gly 935 940 945 gat gag gat tac tgtcct cgc agt aag aaa gca aac tta ggt aaa aat 3237 Asp Glu Asp Tyr Cys ProArg Ser Lys Lys Ala Asn Leu Gly Lys Asn 950 955 960 gca agc atg aac acacaa cat gga aca gca aca gaa gtt gct gta gag 3285 Ala Ser Met Asn Thr GlnHis Gly Thr Ala Thr Glu Val Ala Val Glu 965 970 975 980 aca acc aca cccaaa caa gga cag aac cta tgg ttt tta tgt gat agt 3333 Thr Thr Thr Pro LysGln Gly Gln Asn Leu Trp Phe Leu Cys Asp Ser 985 990 995 caa aag gag ctggat gag ttg cta aac tgt ctt cac cct cag gga ata 3381 Gln Lys Glu Leu AspGlu Leu Leu Asn Cys Leu His Pro Gln Gly Ile 1000 1005 1010 aga gaa agtcaa ctt aaa gag aga cta gag aag agg tac cag gac att 3429 Arg Glu Ser GlnLeu Lys Glu Arg Leu Glu Lys Arg Tyr Gln Asp Ile 1015 1020 1025 att cactct att cat cta gca cgg aag cca aat ttg ggt cta aaa tct 3477 Ile His SerIle His Leu Ala Arg Lys Pro Asn Leu Gly Leu Lys Ser 1030 1035 1040 tgtgat ggc aac cag gag ctt tta aac ttc ctt cgt agt gat ctc att 3525 Cys AspGly Asn Gln Glu Leu Leu Asn Phe Leu Arg Ser Asp Leu Ile 1045 1050 10551060 gaa gtt gca aca agg tta caa aaa gga gga ctt gga tat gtg gaa gaa3573 Glu Val Ala Thr Arg Leu Gln Lys Gly Gly Leu Gly Tyr Val Glu Glu1065 1070 1075 aca tca gaa ttt gaa gcc cgg gtc att tca tta gag aaa ttgaag gat 3621 Thr Ser Glu Phe Glu Ala Arg Val Ile Ser Leu Glu Lys Leu LysAsp 1080 1085 1090 ttt ggt gag tgt gtg att gcc ctt cag gcc agt gtc ataaag aaa ttt 3669 Phe Gly Glu Cys Val Ile Ala Leu Gln Ala Ser Val Ile LysLys Phe 1095 1100 1105 ctc caa ggc ttc atg gct ccc aag caa aag aga agaaaa ctc caa agt 3717 Leu Gln Gly Phe Met Ala Pro Lys Gln Lys Arg Arg LysLeu Gln Ser 1110 1115 1120 gaa gat tca gca aaa act gag gaa gtg gat gaagag aag aaa atg gta 3765 Glu Asp Ser Ala Lys Thr Glu Glu Val Asp Glu GluLys Lys Met Val 1125 1130 1135 1140 gag gaa gca aag gtt gca tct gca ctggag aaa tgg aag aca gca atc 3813 Glu Glu Ala Lys Val Ala Ser Ala Leu GluLys Trp Lys Thr Ala Ile 1145 1150 1155 cgg gaa gct cag act ttc tcc aggatg cac gtg ctg ctt ggg atg ctt 3861 Arg Glu Ala Gln Thr Phe Ser Arg MetHis Val Leu Leu Gly Met Leu 1160 1165 1170 gat gcc tgt atc aag tgg gatatg tcc gca gaa aat gct agg tgc aaa 3909 Asp Ala Cys Ile Lys Trp Asp MetSer Ala Glu Asn Ala Arg Cys Lys 1175 1180 1185 gtt tgt cca aag aaa ggtgag gat gac aaa ttg atc ttg tgt gat gag 3957 Val Cys Pro Lys Lys Gly GluAsp Asp Lys Leu Ile Leu Cys Asp Glu 1190 1195 1200 tgt aat aaa gcc ttccac ctg ttt tgt ctg agg ccg gcc ctc tat gaa 4005 Cys Asn Lys Ala Phe HisLeu Phe Cys Leu Arg Pro Ala Leu Tyr Glu 1205 1210 1215 1220 gta cca gatggt gag tgg cag tgc cca gct tgc cag ccc gct act gcc 4053 Val Pro Asp GlyGlu Trp Gln Cys Pro Ala Cys Gln Pro Ala Thr Ala 1225 1230 1235 agg cgcaac tcc cgt ggc agg aac tat act gaa gag tct gct tct gag 4101 Arg Arg AsnSer Arg Gly Arg Asn Tyr Thr Glu Glu Ser Ala Ser Glu 1240 1245 1250 gacagt gaa gat gat gag agt gat gaa gag gag gag gag gaa gaa gag 4149 Asp SerGlu Asp Asp Glu Ser Asp Glu Glu Glu Glu Glu Glu Glu Glu 1255 1260 1265gag gag gag gaa gaa gat tat gag gtg gct ggt ttg cga ttg aga cct 4197 GluGlu Glu Glu Glu Asp Tyr Glu Val Ala Gly Leu Arg Leu Arg Pro 1270 12751280 cga aag acc atc cgg ggc aag cac agc gtc atc ccc cct gca gca agg4245 Arg Lys Thr Ile Arg Gly Lys His Ser Val Ile Pro Pro Ala Ala Arg1285 1290 1295 1300 tca ggc cgg cgc ccg ggt aag aag cca cac tct acc aggagg tct cag 4293 Ser Gly Arg Arg Pro Gly Lys Lys Pro His Ser Thr Arg ArgSer Gln 1305 1310 1315 ccc aag gca cca cct gtg gat gat gct gag gtg gatgag ctg gtg ctt 4341 Pro Lys Ala Pro Pro Val Asp Asp Ala Glu Val Asp GluLeu Val Leu 1320 1325 1330 cag acc aag cgg agc tcc cgg agg caa agc ctggag ctg cag aag tgt 4389 Gln Thr Lys Arg Ser Ser Arg Arg Gln Ser Leu GluLeu Gln Lys Cys 1335 1340 1345 gaa gag atc ctc cac aag atc gtg aag taccgc ttc agc tgg ccc ttc 4437 Glu Glu Ile Leu His Lys Ile Val Lys Tyr ArgPhe Ser Trp Pro Phe 1350 1355 1360 agg gag cct gtg acc aga gat gag gccgag gac tac tat gat gtg atc 4485 Arg Glu Pro Val Thr Arg Asp Glu Ala GluAsp Tyr Tyr Asp Val Ile 1365 1370 1375 1380 acg cac ccc atg gac ttt cagaca gtg cag aac aaa tgt tcc tgt ggg 4533 Thr His Pro Met Asp Phe Gln ThrVal Gln Asn Lys Cys Ser Cys Gly 1385 1390 1395 agc tac cgc tct gtg caggag ttt ctt act gac atg aag caa gtg ttt 4581 Ser Tyr Arg Ser Val Gln GluPhe Leu Thr Asp Met Lys Gln Val Phe 1400 1405 1410 acc aat gct gag gtttac aac tgc cgt ggc agc cat gtg cta agc tgc 4629 Thr Asn Ala Glu Val TyrAsn Cys Arg Gly Ser His Val Leu Ser Cys 1415 1420 1425 atg gtg aag acagaa cag tgt cta gtg gtt ctg ttg cat aaa cac ctt 4677 Met Val Lys Thr GluGln Cys Leu Val Val Leu Leu His Lys His Leu 1430 1435 1440 cct ggc caccca tat gtc cgc agg aag cgc aag aag ttt cct gat agg 4725 Pro Gly His ProTyr Val Arg Arg Lys Arg Lys Lys Phe Pro Asp Arg 1445 1450 1455 1460 cttgct gaa gat gaa ggg gac agt gag cca gag gcc gtt gga cag tcc 4773 Leu AlaGlu Asp Glu Gly Asp Ser Glu Pro Glu Ala Val Gly Gln Ser 1465 1470 1475agg gac gaa gac aga aga agt aga gag gcg gag att cag gaa tgg ctc 4821 ArgAsp Glu Asp Arg Arg Ser Arg Glu Ala Glu Ile Gln Glu Trp Leu 1480 14851490 cag gac acg tcc ctt tac tct gcc aag atc aac tca aaa gac cac aac4869 Gln Asp Thr Ser Leu Tyr Ser Ala Lys Ile Asn Ser Lys Asp His Asn1495 1500 1505 tgt ttc atg atg ctg gtg aat aca caa ttc tgt atg gca ctcact gat 4917 Cys Phe Met Met Leu Val Asn Thr Gln Phe Cys Met Ala Leu ThrAsp 1510 1515 1520 act gtc acc tgagaggaag acgggggaag agacagagtatgggcttaaa 4966 Thr Val Thr 1525 gaaacaagac tgtataataa atacagattaaaaaagaaaa atcgccacca tctcccctgt 5026 tggcctgatt accccgatcc tgctatgtaacacagcaatc cctcccctgg agaccagagg 5086 ggcttggcac tgtggtggaa gccagaacgagcaggccctt aggaaagaag gcaggaacag 5146 gaactggctt caccagaaaa gctagaccctcggactcctc ctggaaactc tcagaaggga 5206 gggttatggc cctctttgtc ccttcatatttctggacaaa gaccaccaac ccaatatcaa 5266 gccccataaa gagcttttag aaaaacagcataagcttggg atgacaggcg tttctggact 5326 ccctgtgatc tcttccaggt tcttggtcttcctcgctcgc ctccctccca ccctccctag 5386 ctgtcccccc acctcagctc ccttaccacggccctgcctc tctacttctt ctgtcttcgt 5446 cccctggact gtccaacggc ctctggctcactgtcccttc atcttcagca acctatcagg 5506 aaacttcttg cgcttcctgc ggacatatgggtggccagga agtgtttatg caaca 5561 <210> SEQ ID NO 29 <211> LENGTH: 1531<212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 29 Met AlaPro Leu Leu Gly Arg Lys Pro Phe Pro Leu Val Asn Pro Leu 1 5 10 15 ProGly Glu Glu Pro Phe Phe Thr Ile Pro His Thr Gln Glu Ala Phe 20 25 30 ArgThr Arg Glu Glu Tyr Glu Ala Arg Leu Glu Arg Tyr Ser Glu Arg 35 40 45 IleTrp Thr Cys Lys Ser Thr Gly Ser Ser Gln Leu Thr His Lys Glu 50 55 60 AlaTrp Glu Glu Glu Gln Glu Val Ala Glu Leu Leu Lys Glu Glu Phe 65 70 75 80Pro Ala Trp Tyr Glu Lys Leu Val Leu Glu Met Val His His Asn Thr 85 90 95Ala Ser Leu Glu Lys Leu Val Asp Thr Ala Trp Leu Glu Ile Met Thr 100 105110 Lys Tyr Ala Val Gly Glu Glu Cys Asp Phe Glu Val Gly Lys Glu Lys 115120 125 Met Leu Lys Val Lys Ile Val Lys Ile His Pro Leu Glu Lys Val Asp130 135 140 Glu Glu Ala Thr Glu Lys Lys Ser Asp Gly Ala Cys Asp Ser ProSer 145 150 155 160 Ser Asp Lys Glu Asn Ser Ser Gln Ile Ala Gln Asp HisGln Lys Lys 165 170 175 Glu Thr Val Val Lys Glu Asp Glu Gly Arg Arg GluSer Ile Asn Asp 180 185 190 Arg Ala Arg Arg Ser Pro Arg Lys Leu Pro ThrSer Leu Lys Lys Gly 195 200 205 Glu Arg Lys Trp Ala Pro Pro Lys Phe LeuPro His Lys Tyr Asp Val 210 215 220 Lys Leu Gln Asn Glu Asp Lys Ile IleSer Asn Val Pro Ala Asp Ser 225 230 235 240 Leu Ile Arg Thr Glu Arg ProPro Asn Lys Glu Ile Val Arg Tyr Phe 245 250 255 Ile Arg His Asn Ala LeuArg Ala Gly Thr Gly Glu Asn Ala Pro Trp 260 265 270 Val Val Glu Asp GluLeu Val Lys Lys Tyr Ser Leu Pro Ser Lys Phe 275 280 285 Ser Asp Phe LeuLeu Asp Pro Tyr Lys Tyr Met Thr Leu Asn Pro Ser 290 295 300 Thr Lys ArgLys Asn Thr Gly Ser Pro Asp Arg Lys Pro Ser Lys Lys 305 310 315 320 SerLys Thr Asp Asn Ser Ser Leu Ser Ser Pro Leu Asn Pro Lys Leu 325 330 335Trp Cys His Val His Leu Lys Lys Ser Leu Ser Gly Ser Pro Leu Lys 340 345350 Val Lys Asn Ser Lys Asn Ser Lys Ser Pro Glu Glu His Leu Glu Glu 355360 365 Met Met Lys Met Met Ser Pro Asn Lys Leu His Thr Asn Phe His Ile370 375 380 Pro Lys Lys Gly Pro Pro Ala Lys Lys Pro Gly Lys His Ser AspLys 385 390 395 400 Pro Leu Lys Ala Lys Gly Arg Ser Lys Gly Ile Leu AsnGly Gln Lys 405 410 415 Ser Thr Gly Asn Ser Lys Ser Pro Lys Lys Gly LeuLys Thr Pro Lys 420 425 430 Thr Lys Met Lys Gln Met Thr Leu Leu Asp MetAla Lys Gly Thr Gln 435 440 445 Lys Met Thr Arg Ala Pro Arg Asn Ser GlyGly Thr Pro Arg Thr Ser 450 455 460 Ser Lys Pro His Lys His Leu Pro ProAla Ala Leu His Leu Ile Ala 465 470 475 480 Tyr Tyr Lys Glu Asn Lys AspArg Glu Asp Lys Arg Ser Ala Leu Ser 485 490 495 Cys Val Ile Ser Lys ThrAla Arg Leu Leu Ser Ser Glu Asp Arg Ala 500 505 510 Arg Leu Pro Glu GluLeu Arg Ser Leu Val Gln Lys Arg Tyr Glu Leu 515 520 525 Leu Glu His LysLys Arg Trp Ala Ser Met Ser Glu Glu Gln Arg Lys 530 535 540 Glu Tyr LeuLys Lys Lys Arg Glu Glu Leu Lys Lys Lys Leu Lys Glu 545 550 555 560 LysAla Lys Glu Arg Arg Glu Lys Glu Met Leu Glu Arg Leu Glu Lys 565 570 575Gln Lys Arg Tyr Glu Asp Gln Glu Leu Thr Gly Lys Asn Leu Pro Ala 580 585590 Phe Arg Leu Val Asp Thr Pro Glu Gly Leu Pro Asn Thr Leu Phe Gly 595600 605 Asp Val Ala Met Val Val Glu Phe Leu Ser Cys Tyr Ser Gly Leu Leu610 615 620 Leu Pro Asp Ala Gln Tyr Pro Ile Thr Ala Val Ser Leu Met GluAla 625 630 635 640 Leu Ser Ala Asp Lys Gly Gly Phe Leu Tyr Leu Asn ArgVal Leu Val 645 650 655 Ile Leu Leu Gln Thr Leu Leu Gln Thr Leu Leu GlnAsp Glu Ile Ala 660 665 670 Glu Asp Tyr Gly Glu Leu Gly Met Lys Leu SerGlu Ile Pro Leu Thr 675 680 685 Leu His Ser Val Ser Glu Leu Val Arg LeuCys Leu Arg Arg Ser Asp 690 695 700 Val Gln Glu Glu Ser Glu Gly Ser AspThr Asp Asp Asn Lys Asp Ser 705 710 715 720 Ala Ala Phe Glu Asp Asn GluVal Gln Asp Glu Phe Leu Glu Lys Leu 725 730 735 Glu Thr Ser Glu Phe PheGlu Leu Thr Ser Glu Glu Lys Leu Gln Ile 740 745 750 Leu Thr Ala Leu CysHis Arg Ile Leu Met Thr Tyr Ser Val Gln Asp 755 760 765 His Met Glu ThrArg Gln Gln Met Ser Ala Glu Leu Trp Lys Glu Arg 770 775 780 Leu Ala ValLeu Lys Glu Glu Asn Asp Lys Lys Arg Ala Glu Lys Gln 785 790 795 800 LysArg Lys Glu Met Glu Ala Lys Asn Lys Glu Asn Gly Lys Val Glu 805 810 815Asn Gly Leu Gly Lys Thr Asp Arg Lys Lys Arg Ile Val Lys Phe Glu 820 825830 Pro Gln Val Asp Thr Glu Ala Glu Asp Met Ile Ser Ala Val Lys Ser 835840 845 Arg Arg Leu Leu Ala Ile Gln Ala Lys Lys Glu Arg Glu Ile Gln Glu850 855 860 Arg Glu Met Lys Val Lys Leu Glu Arg Gln Ala Glu Glu Glu ArgIle 865 870 875 880 Arg Lys His Lys Ala Ala Ala Glu Lys Ala Phe Gln GluGly Ile Ala 885 890 895 Lys Ala Lys Leu Val Met Arg Arg Thr Pro Ile GlyThr Asp Arg Asn 900 905 910 His Asn Arg Tyr Trp Leu Phe Ser Asp Glu ValPro Gly Leu Phe Ile 915 920 925 Glu Lys Gly Trp Val His Asp Ser Ile AspTyr Arg Phe Asn His His 930 935 940 Cys Lys Asp His Thr Val Ser Gly AspGlu Asp Tyr Cys Pro Arg Ser 945 950 955 960 Lys Lys Ala Asn Leu Gly LysAsn Ala Ser Met Asn Thr Gln His Gly 965 970 975 Thr Ala Thr Glu Val AlaVal Glu Thr Thr Thr Pro Lys Gln Gly Gln 980 985 990 Asn Leu Trp Phe LeuCys Asp Ser Gln Lys Glu Leu Asp Glu Leu Leu 995 1000 1005 Asn Cys LeuHis Pro Gln Gly Ile Arg Glu Ser Gln Leu Lys Glu Arg 1010 1015 1020 LeuGlu Lys Arg Tyr Gln Asp Ile Ile His Ser Ile His Leu Ala Arg 1025 10301035 1040 Lys Pro Asn Leu Gly Leu Lys Ser Cys Asp Gly Asn Gln Glu LeuLeu 1045 1050 1055 Asn Phe Leu Arg Ser Asp Leu Ile Glu Val Ala Thr ArgLeu Gln Lys 1060 1065 1070 Gly Gly Leu Gly Tyr Val Glu Glu Thr Ser GluPhe Glu Ala Arg Val 1075 1080 1085 Ile Ser Leu Glu Lys Leu Lys Asp PheGly Glu Cys Val Ile Ala Leu 1090 1095 1100 Gln Ala Ser Val Ile Lys LysPhe Leu Gln Gly Phe Met Ala Pro Lys 1105 1110 1115 1120 Gln Lys Arg ArgLys Leu Gln Ser Glu Asp Ser Ala Lys Thr Glu Glu 1125 1130 1135 Val AspGlu Glu Lys Lys Met Val Glu Glu Ala Lys Val Ala Ser Ala 1140 1145 1150Leu Glu Lys Trp Lys Thr Ala Ile Arg Glu Ala Gln Thr Phe Ser Arg 11551160 1165 Met His Val Leu Leu Gly Met Leu Asp Ala Cys Ile Lys Trp AspMet 1170 1175 1180 Ser Ala Glu Asn Ala Arg Cys Lys Val Cys Pro Lys LysGly Glu Asp 1185 1190 1195 1200 Asp Lys Leu Ile Leu Cys Asp Glu Cys AsnLys Ala Phe His Leu Phe 1205 1210 1215 Cys Leu Arg Pro Ala Leu Tyr GluVal Pro Asp Gly Glu Trp Gln Cys 1220 1225 1230 Pro Ala Cys Gln Pro AlaThr Ala Arg Arg Asn Ser Arg Gly Arg Asn 1235 1240 1245 Tyr Thr Glu GluSer Ala Ser Glu Asp Ser Glu Asp Asp Glu Ser Asp 1250 1255 1260 Glu GluGlu Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu Asp Tyr Glu 1265 1270 12751280 Val Ala Gly Leu Arg Leu Arg Pro Arg Lys Thr Ile Arg Gly Lys His1285 1290 1295 Ser Val Ile Pro Pro Ala Ala Arg Ser Gly Arg Arg Pro GlyLys Lys 1300 1305 1310 Pro His Ser Thr Arg Arg Ser Gln Pro Lys Ala ProPro Val Asp Asp 1315 1320 1325 Ala Glu Val Asp Glu Leu Val Leu Gln ThrLys Arg Ser Ser Arg Arg 1330 1335 1340 Gln Ser Leu Glu Leu Gln Lys CysGlu Glu Ile Leu His Lys Ile Val 1345 1350 1355 1360 Lys Tyr Arg Phe SerTrp Pro Phe Arg Glu Pro Val Thr Arg Asp Glu 1365 1370 1375 Ala Glu AspTyr Tyr Asp Val Ile Thr His Pro Met Asp Phe Gln Thr 1380 1385 1390 ValGln Asn Lys Cys Ser Cys Gly Ser Tyr Arg Ser Val Gln Glu Phe 1395 14001405 Leu Thr Asp Met Lys Gln Val Phe Thr Asn Ala Glu Val Tyr Asn Cys1410 1415 1420 Arg Gly Ser His Val Leu Ser Cys Met Val Lys Thr Glu GlnCys Leu 1425 1430 1435 1440 Val Val Leu Leu His Lys His Leu Pro Gly HisPro Tyr Val Arg Arg 1445 1450 1455 Lys Arg Lys Lys Phe Pro Asp Arg LeuAla Glu Asp Glu Gly Asp Ser 1460 1465 1470 Glu Pro Glu Ala Val Gly GlnSer Arg Asp Glu Asp Arg Arg Ser Arg 1475 1480 1485 Glu Ala Glu Ile GlnGlu Trp Leu Gln Asp Thr Ser Leu Tyr Ser Ala 1490 1495 1500 Lys Ile AsnSer Lys Asp His Asn Cys Phe Met Met Leu Val Asn Thr 1505 1510 1515 1520Gln Phe Cys Met Ala Leu Thr Asp Thr Val Thr 1525 1530 <210> SEQ ID NO 30<211> LENGTH: 5573 <212> TYPE: DNA <213> ORGANISM: Homo sapiens <220>FEATURE: <221> NAME/KEY: CDS <222> LOCATION: (346)...(4938) <400>SEQUENCE: 30 cgggcccggg ggaggagggg aatctcccgc catttttcaa taatttcctccggtgctgct 60 gaggaggagt cgtgactgcc ggccgccggg acccgaagcg gaggtcggcggggggctgct 120 gggaggcgcg gcggtgtgcg cgggagctct gcgccgtggc gttccgctccatgactgtcg 180 cgcggccgcg ccggcggtga gggagccgga gttcgcgccg ccctctcacccctcccttcc 240 cccaccccac ccccgggcgc ctggcgctcg ctccgggccg cggggcctagtgctgcgccg 300 cggggccggc cccagcagcc gccagtcccc accgccgccg ccgcg atg gcgccg ctc 357 Met Ala Pro Leu 1 ctg ggc cgc aag ccc ttc ccg ctg gtg aatccg ttg ccc gga gag gag 405 Leu Gly Arg Lys Pro Phe Pro Leu Val Asn ProLeu Pro Gly Glu Glu 5 10 15 20 ccg ttc ttc acc atc ccg cac act cag gaggcc ttc cgc acc cgg gaa 453 Pro Phe Phe Thr Ile Pro His Thr Gln Glu AlaPhe Arg Thr Arg Glu 25 30 35 gag tat gaa gcc cgc ttg gaa agg tac agt gagcgc att tgg acg tgc 501 Glu Tyr Glu Ala Arg Leu Glu Arg Tyr Ser Glu ArgIle Trp Thr Cys 40 45 50 aag agt act gga agc agt cag cta aca cac aag gaagcc tgg gag gaa 549 Lys Ser Thr Gly Ser Ser Gln Leu Thr His Lys Glu AlaTrp Glu Glu 55 60 65 gaa cag gaa gtt gct gag ctt ttg aag gag gag ttt cctgcc tgg tat 597 Glu Gln Glu Val Ala Glu Leu Leu Lys Glu Glu Phe Pro AlaTrp Tyr 70 75 80 gag aag ctt gtt ctg gaa atg gtt cac cat aac aca gcc tcctta gag 645 Glu Lys Leu Val Leu Glu Met Val His His Asn Thr Ala Ser LeuGlu 85 90 95 100 aag tta gta gat act gct tgg ttg gag atc atg acc aaa tatgct gtg 693 Lys Leu Val Asp Thr Ala Trp Leu Glu Ile Met Thr Lys Tyr AlaVal 105 110 115 gga gaa gag tgt gac ttc gag gtt ggg aag gag aaa atg ctcaag gtg 741 Gly Glu Glu Cys Asp Phe Glu Val Gly Lys Glu Lys Met Leu LysVal 120 125 130 aag att gtg aag att cat cct ttg gag aaa gtg gat gaa gaggcc act 789 Lys Ile Val Lys Ile His Pro Leu Glu Lys Val Asp Glu Glu AlaThr 135 140 145 gag aag aaa tct gat ggt gcc tgt gat tct cca tca agt gacaaa gag 837 Glu Lys Lys Ser Asp Gly Ala Cys Asp Ser Pro Ser Ser Asp LysGlu 150 155 160 aac tcc agt cag att gct cag gac cat cag aag aag gag acagtt gtg 885 Asn Ser Ser Gln Ile Ala Gln Asp His Gln Lys Lys Glu Thr ValVal 165 170 175 180 aaa gag gat gaa gga agg aga gag agt att aat gac agagca cgt aga 933 Lys Glu Asp Glu Gly Arg Arg Glu Ser Ile Asn Asp Arg AlaArg Arg 185 190 195 tcg cca cga aaa ctt cct act tca tta aaa aaa gga gaaagg aaa tgg 981 Ser Pro Arg Lys Leu Pro Thr Ser Leu Lys Lys Gly Glu ArgLys Trp 200 205 210 gct cct cca aaa ttt ctg cct cac aaa tat gat gtg aaacta caa aat 1029 Ala Pro Pro Lys Phe Leu Pro His Lys Tyr Asp Val Lys LeuGln Asn 215 220 225 gaa gat aag atc atc agt aac gtg cca gca gac agc ttgatt cgt aca 1077 Glu Asp Lys Ile Ile Ser Asn Val Pro Ala Asp Ser Leu IleArg Thr 230 235 240 gag cgc cca cca aat aag gag ata gtt cga tac ttt atacgg cat aat 1125 Glu Arg Pro Pro Asn Lys Glu Ile Val Arg Tyr Phe Ile ArgHis Asn 245 250 255 260 gca tta cga gct ggt act ggt gaa aat gca cct tgggtc gta gaa gat 1173 Ala Leu Arg Ala Gly Thr Gly Glu Asn Ala Pro Trp ValVal Glu Asp 265 270 275 gaa ttg gtg aag aaa tac tct ctg ccc agc aag ttcagt gac ttt tta 1221 Glu Leu Val Lys Lys Tyr Ser Leu Pro Ser Lys Phe SerAsp Phe Leu 280 285 290 ctt gat cca tac aag tat atg act ctc aac cct tctact aag agg aag 1269 Leu Asp Pro Tyr Lys Tyr Met Thr Leu Asn Pro Ser ThrLys Arg Lys 295 300 305 aat act gga tcc cca gac agg aag ccc tca aag aaatcc aag aca gac 1317 Asn Thr Gly Ser Pro Asp Arg Lys Pro Ser Lys Lys SerLys Thr Asp 310 315 320 aac tct tct ctt agt tca cca cta aat cct aag ttatgg tgt cac gta 1365 Asn Ser Ser Leu Ser Ser Pro Leu Asn Pro Lys Leu TrpCys His Val 325 330 335 340 cac ttg aag aag tca ttg agt ggc tcg cca ctcaaa gtg aag aac tca 1413 His Leu Lys Lys Ser Leu Ser Gly Ser Pro Leu LysVal Lys Asn Ser 345 350 355 aag aat tcc aaa tct cct gaa gaa cat cta gaagaa atg atg aag atg 1461 Lys Asn Ser Lys Ser Pro Glu Glu His Leu Glu GluMet Met Lys Met 360 365 370 atg tcg ccc aat aag ttg cac act aac ttt cacatt cct aaa aaa ggc 1509 Met Ser Pro Asn Lys Leu His Thr Asn Phe His IlePro Lys Lys Gly 375 380 385 cca cct gcc aag aaa cca ggg aag cac agt gacaag cct ttg aag gca 1557 Pro Pro Ala Lys Lys Pro Gly Lys His Ser Asp LysPro Leu Lys Ala 390 395 400 aag ggc aga agc aaa ggc atc ctg aat gga cagaaa tcc aca ggg aat 1605 Lys Gly Arg Ser Lys Gly Ile Leu Asn Gly Gln LysSer Thr Gly Asn 405 410 415 420 tcc aaa tct ccc aaa aaa gga ctg aag actcct aaa acc aaa atg aag 1653 Ser Lys Ser Pro Lys Lys Gly Leu Lys Thr ProLys Thr Lys Met Lys 425 430 435 cag atg act ttg ttg gat atg gcc aaa ggcacg cag aag atg aca cga 1701 Gln Met Thr Leu Leu Asp Met Ala Lys Gly ThrGln Lys Met Thr Arg 440 445 450 gcc cca cgg aat tct ggg ggt aca cct aggacc tct agt aaa cct cat 1749 Ala Pro Arg Asn Ser Gly Gly Thr Pro Arg ThrSer Ser Lys Pro His 455 460 465 aaa cat ctg cct cct gca gcc cta cac ctcatt gca tac tac aaa gaa 1797 Lys His Leu Pro Pro Ala Ala Leu His Leu IleAla Tyr Tyr Lys Glu 470 475 480 aac aaa gac agg gag gac aag agg agc gccctg tcc tgt gtt atc tcc 1845 Asn Lys Asp Arg Glu Asp Lys Arg Ser Ala LeuSer Cys Val Ile Ser 485 490 495 500 aaa aca gct cgt ctt ctc tct agt gaagat aga gct cgt ctc cca gaa 1893 Lys Thr Ala Arg Leu Leu Ser Ser Glu AspArg Ala Arg Leu Pro Glu 505 510 515 gaa ttg cga agt ctt gtt caa aaa cgctat gaa ctt cta gag cac aaa 1941 Glu Leu Arg Ser Leu Val Gln Lys Arg TyrGlu Leu Leu Glu His Lys 520 525 530 aag agg tgg gct tct atg tct gaa gaacaa cgg aaa gaa tat ttg aaa 1989 Lys Arg Trp Ala Ser Met Ser Glu Glu GlnArg Lys Glu Tyr Leu Lys 535 540 545 aag aaa cgg gag gag ctg aaa aag aagttg aag gaa aaa gcc aaa gaa 2037 Lys Lys Arg Glu Glu Leu Lys Lys Lys LeuLys Glu Lys Ala Lys Glu 550 555 560 cga aga gag aaa gaa atg ctt gag agatta gaa aaa cag aag cgg tat 2085 Arg Arg Glu Lys Glu Met Leu Glu Arg LeuGlu Lys Gln Lys Arg Tyr 565 570 575 580 gag gac caa gag tta act ggc aaaaac ctt cca gca ttc aga ttg gtg 2133 Glu Asp Gln Glu Leu Thr Gly Lys AsnLeu Pro Ala Phe Arg Leu Val 585 590 595 gat acc cct gaa ggg ctg ccc aacacg ctg ttt ggg gat gtg gcc atg 2181 Asp Thr Pro Glu Gly Leu Pro Asn ThrLeu Phe Gly Asp Val Ala Met 600 605 610 gtg gtg gaa ttc ttg agc tgt tattct ggg cta ctt tta cca gat gct 2229 Val Val Glu Phe Leu Ser Cys Tyr SerGly Leu Leu Leu Pro Asp Ala 615 620 625 cag tat cct att act gct gtg tccctt atg gaa gcc ttg agt gca gat 2277 Gln Tyr Pro Ile Thr Ala Val Ser LeuMet Glu Ala Leu Ser Ala Asp 630 635 640 aag ggt ggc ttt tta tac ctt aacagg gtg ttg gtc atc ctc tta cag 2325 Lys Gly Gly Phe Leu Tyr Leu Asn ArgVal Leu Val Ile Leu Leu Gln 645 650 655 660 acc ctc cta cag acc ctc ctacaa gat gag ata gcc gaa gac tat ggt 2373 Thr Leu Leu Gln Thr Leu Leu GlnAsp Glu Ile Ala Glu Asp Tyr Gly 665 670 675 gaa ttg gga atg aag ctg tcagaa atc ccc ttg act ctg cat tct gtt 2421 Glu Leu Gly Met Lys Leu Ser GluIle Pro Leu Thr Leu His Ser Val 680 685 690 tca gag ctg gtg cgg ctc tgcttg cgc aga tct gat gtt cag gag gaa 2469 Ser Glu Leu Val Arg Leu Cys LeuArg Arg Ser Asp Val Gln Glu Glu 695 700 705 agc gag ggc tca gac aca gatgac aat aaa gat tca gct gca ttt gag 2517 Ser Glu Gly Ser Asp Thr Asp AspAsn Lys Asp Ser Ala Ala Phe Glu 710 715 720 gat aat gag gta caa gat gagttc cta gaa aag ctg gag acc tct gaa 2565 Asp Asn Glu Val Gln Asp Glu PheLeu Glu Lys Leu Glu Thr Ser Glu 725 730 735 740 ttt ttt gag ctg acg tcagag gag aag cta cag att ttg aca gca ctg 2613 Phe Phe Glu Leu Thr Ser GluGlu Lys Leu Gln Ile Leu Thr Ala Leu 745 750 755 tgc cac cgg atc ctc atgaca tac tca gtg caa gac cac atg gag acc 2661 Cys His Arg Ile Leu Met ThrTyr Ser Val Gln Asp His Met Glu Thr 760 765 770 aga cag cag atg tct gcagag ttg tgg aag gaa cgg ctt gct gtg ttg 2709 Arg Gln Gln Met Ser Ala GluLeu Trp Lys Glu Arg Leu Ala Val Leu 775 780 785 aag gaa gaa aat gat aagaag aga gca gag aaa cag aaa cgg aaa gaa 2757 Lys Glu Glu Asn Asp Lys LysArg Ala Glu Lys Gln Lys Arg Lys Glu 790 795 800 atg gaa gcc aaa aat aaagaa aat gga aaa gtt gag aat ggg tta ggc 2805 Met Glu Ala Lys Asn Lys GluAsn Gly Lys Val Glu Asn Gly Leu Gly 805 810 815 820 aaa act gat agg aaaaaa aga att gtg aag ttt gag ccc caa gta gat 2853 Lys Thr Asp Arg Lys LysArg Ile Val Lys Phe Glu Pro Gln Val Asp 825 830 835 aca gaa gct gaa gacatg att agt gct gtg aag agc aga agg ttg ctt 2901 Thr Glu Ala Glu Asp MetIle Ser Ala Val Lys Ser Arg Arg Leu Leu 840 845 850 gcc att caa gct aagaag gaa cgg gaa atc cag gaa aga gaa atg aaa 2949 Ala Ile Gln Ala Lys LysGlu Arg Glu Ile Gln Glu Arg Glu Met Lys 855 860 865 gtg aaa ctg gaa cgccaa gct gaa gaa gaa cga ata cgg aag cac aaa 2997 Val Lys Leu Glu Arg GlnAla Glu Glu Glu Arg Ile Arg Lys His Lys 870 875 880 gca gct gct gag aaagct ttc cag gaa ggg att gcc aag gcc aaa cta 3045 Ala Ala Ala Glu Lys AlaPhe Gln Glu Gly Ile Ala Lys Ala Lys Leu 885 890 895 900 gtc atg cgc aggact cct att ggc aca gat cga aac cat aat aga tac 3093 Val Met Arg Arg ThrPro Ile Gly Thr Asp Arg Asn His Asn Arg Tyr 905 910 915 tgg ctc ttc tcagat gaa gtt cca gga tta ttc att gaa aaa ggc tgg 3141 Trp Leu Phe Ser AspGlu Val Pro Gly Leu Phe Ile Glu Lys Gly Trp 920 925 930 gta cat gac agcatt gac tac cga ttc aac cat cac tgc aaa gac cac 3189 Val His Asp Ser IleAsp Tyr Arg Phe Asn His His Cys Lys Asp His 935 940 945 aca gtc tct ggtgat gag gat tac tgt cct cgc agt aag aaa gca aac 3237 Thr Val Ser Gly AspGlu Asp Tyr Cys Pro Arg Ser Lys Lys Ala Asn 950 955 960 tta ggt aaa aatgca agc atg aac aca caa cat gga aca gca aca gaa 3285 Leu Gly Lys Asn AlaSer Met Asn Thr Gln His Gly Thr Ala Thr Glu 965 970 975 980 gtt gct gtagag aca acc aca ccc aaa caa gga cag aac cta tgg ttt 3333 Val Ala Val GluThr Thr Thr Pro Lys Gln Gly Gln Asn Leu Trp Phe 985 990 995 tta tgt gatagt caa aag gag ctg gat gag ttg cta aac tgt ctt cac 3381 Leu Cys Asp SerGln Lys Glu Leu Asp Glu Leu Leu Asn Cys Leu His 1000 1005 1010 cct caggga ata aga gaa agt caa ctt aaa gag aga cta gag aag agg 3429 Pro Gln GlyIle Arg Glu Ser Gln Leu Lys Glu Arg Leu Glu Lys Arg 1015 1020 1025 taccag gac att att cac tct att cat cta gca cgg aag cca aat ttg 3477 Tyr GlnAsp Ile Ile His Ser Ile His Leu Ala Arg Lys Pro Asn Leu 1030 1035 1040ggt cta aaa tct tgt gat ggc aac cag gag ctt tta aac ttc ctt cgt 3525 GlyLeu Lys Ser Cys Asp Gly Asn Gln Glu Leu Leu Asn Phe Leu Arg 1045 10501055 1060 agt gat ctc att gaa gtt gca aca agg tta caa aaa gga gga cttgga 3573 Ser Asp Leu Ile Glu Val Ala Thr Arg Leu Gln Lys Gly Gly Leu Gly1065 1070 1075 tat gtg gaa gaa aca tca gaa ttt gaa gcc cgg gtc att tcatta gag 3621 Tyr Val Glu Glu Thr Ser Glu Phe Glu Ala Arg Val Ile Ser LeuGlu 1080 1085 1090 aaa ttg aag gat ttt ggt gag tgt gtg att gcc ctt caggcc agt gtc 3669 Lys Leu Lys Asp Phe Gly Glu Cys Val Ile Ala Leu Gln AlaSer Val 1095 1100 1105 ata aag aaa ttt ctc caa ggc ttc atg gct ccc aagcaa aag aga aga 3717 Ile Lys Lys Phe Leu Gln Gly Phe Met Ala Pro Lys GlnLys Arg Arg 1110 1115 1120 aaa ctc caa agt gaa gat tca gca aaa act gaggaa gtg gat gaa gag 3765 Lys Leu Gln Ser Glu Asp Ser Ala Lys Thr Glu GluVal Asp Glu Glu 1125 1130 1135 1140 aag aaa atg gta gag gaa gca aag gttgca tct gca ctg gag aaa tgg 3813 Lys Lys Met Val Glu Glu Ala Lys Val AlaSer Ala Leu Glu Lys Trp 1145 1150 1155 aag aca gca atc cgg gaa gct cagact ttc tcc agg atg cac gtg ctg 3861 Lys Thr Ala Ile Arg Glu Ala Gln ThrPhe Ser Arg Met His Val Leu 1160 1165 1170 ctt ggg atg ctt gat gcc tgtatc aag tgg gat atg tcc gca gaa aat 3909 Leu Gly Met Leu Asp Ala Cys IleLys Trp Asp Met Ser Ala Glu Asn 1175 1180 1185 gct agg tgc aaa gtt tgtcca aag aaa ggt gag gat gac aaa ttg atc 3957 Ala Arg Cys Lys Val Cys ProLys Lys Gly Glu Asp Asp Lys Leu Ile 1190 1195 1200 ttg tgt gat gag tgtaat aaa gcc ttc cac ctg ttt tgt ctg agg ccg 4005 Leu Cys Asp Glu Cys AsnLys Ala Phe His Leu Phe Cys Leu Arg Pro 1205 1210 1215 1220 gcc ctc tatgaa gta cca gat ggt gag tgg cag tgc cca gct tgc cag 4053 Ala Leu Tyr GluVal Pro Asp Gly Glu Trp Gln Cys Pro Ala Cys Gln 1225 1230 1235 ccc gctact gcc agg cgc aac tcc cgt ggc agg aac tat act gaa gag 4101 Pro Ala ThrAla Arg Arg Asn Ser Arg Gly Arg Asn Tyr Thr Glu Glu 1240 1245 1250 tctgct tct gag gac agt gaa gat gat gag agt gat gaa gag gag gag 4149 Ser AlaSer Glu Asp Ser Glu Asp Asp Glu Ser Asp Glu Glu Glu Glu 1255 1260 1265gag gaa gaa gag gag gag gag gaa gaa gat tat gag gtg gct ggt ttg 4197 GluGlu Glu Glu Glu Glu Glu Glu Glu Asp Tyr Glu Val Ala Gly Leu 1270 12751280 cga ttg aga cct cga aag acc atc cgg ggc aag cac agc gtc atc ccc4245 Arg Leu Arg Pro Arg Lys Thr Ile Arg Gly Lys His Ser Val Ile Pro1285 1290 1295 1300 cct gca gca agg tca ggc cgg cgc ccg ggt aag aag ccacac tct acc 4293 Pro Ala Ala Arg Ser Gly Arg Arg Pro Gly Lys Lys Pro HisSer Thr 1305 1310 1315 agg agg tct cag ccc aag gca cca cct gtg gat gatgct gag gtg gat 4341 Arg Arg Ser Gln Pro Lys Ala Pro Pro Val Asp Asp AlaGlu Val Asp 1320 1325 1330 gag ctg gtg ctt cag acc aag cgg agc tcc cggagg caa agc ctg gag 4389 Glu Leu Val Leu Gln Thr Lys Arg Ser Ser Arg ArgGln Ser Leu Glu 1335 1340 1345 ctg cag aag tgt gaa gag atc ctc cac aagatc gtg aag tac cgc ttc 4437 Leu Gln Lys Cys Glu Glu Ile Leu His Lys IleVal Lys Tyr Arg Phe 1350 1355 1360 agc tgg ccc ttc agg gag cct gtg accaga gat gag gcc gag gac tac 4485 Ser Trp Pro Phe Arg Glu Pro Val Thr ArgAsp Glu Ala Glu Asp Tyr 1365 1370 1375 1380 tat gat gtg atc acg cac cccatg gac ttt cag aca gtg cag aac aaa 4533 Tyr Asp Val Ile Thr His Pro MetAsp Phe Gln Thr Val Gln Asn Lys 1385 1390 1395 tgt tcc tgt ggg agc taccgc tct gtg cag gag ttt ctt act gac atg 4581 Cys Ser Cys Gly Ser Tyr ArgSer Val Gln Glu Phe Leu Thr Asp Met 1400 1405 1410 aag caa gtg ttt accaat gct gag gtt tac aac tgc cgt ggc agc cat 4629 Lys Gln Val Phe Thr AsnAla Glu Val Tyr Asn Cys Arg Gly Ser His 1415 1420 1425 gtg cta agc tgcatg gtg aag aca gaa cag tgt cta gtg gtt ctg ttg 4677 Val Leu Ser Cys MetVal Lys Thr Glu Gln Cys Leu Val Val Leu Leu 1430 1435 1440 cat aaa cacctt cct ggc cac cca tat gtc cgc agg aag cgc aag aag 4725 His Lys His LeuPro Gly His Pro Tyr Val Arg Arg Lys Arg Lys Lys 1445 1450 1455 1460 tttcct gat agg ctt gct gaa gat gaa ggg gac agt gag cca gag gcc 4773 Phe ProAsp Arg Leu Ala Glu Asp Glu Gly Asp Ser Glu Pro Glu Ala 1465 1470 1475gtt gga cag tcc agg gac gaa gac aga aga agt aga gag gcg gag att 4821 ValGly Gln Ser Arg Asp Glu Asp Arg Arg Ser Arg Glu Ala Glu Ile 1480 14851490 cag gaa tgg ctc cag gac acg tcc ctt tac tct gcc aag atc aac tca4869 Gln Glu Trp Leu Gln Asp Thr Ser Leu Tyr Ser Ala Lys Ile Asn Ser1495 1500 1505 aaa gac cac aac tgt ttc atg atg ctg gtg aat aca caa ttctgt atg 4917 Lys Asp His Asn Cys Phe Met Met Leu Val Asn Thr Gln Phe CysMet 1510 1515 1520 gca ctc act gat act gtc acc tgagaggaag acgggggaagagacagagta 4968 Ala Leu Thr Asp Thr Val Thr 1525 1530 tgggcttaaagaaacaagac tgtataataa atacagatta aaaaagaaaa atcgccacca 5028 tctcccctgttggcctgatt accccgatcc tgctatgtaa cacagcaatc cctcccctgg 5088 agaccagaggggcttggcac tgtggtggaa gccagaacga gcaggccctt aggaaagaag 5148 gcaggaacaggaactggctt caccagaaaa gctagaccct cggactcctc ctggaaactc 5208 tcagaagggagggttatggc cctctttgtc ccttcatatt tctggacaaa gaccaccaac 5268 ccaatatcaagccccataaa gagcttttag aaaaacagca taagcttggg atgacaggcg 5328 tttctggactccctgtgatc tcttccaggt tcttggtctt cctcgctcgc ctccctccca 5388 ccctccctagctgtcccccc acctcagctc ccttaccacg gccctgcctc tctacttctt 5448 ctgtcttcgtcccctggact gtccaacggc ctctggctca ctgtcccttc atcttcagca 5508 acctatcaggaaacttcttg cgcttcctgc ggacatatgg gtggccagga agtgtttatg 5568 caaca 5573<210> SEQ ID NO 31 <211> LENGTH: 22 <212> TYPE: DNA <213> ORGANISM:Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION:Synthetically generated primer <400> SEQUENCE: 31 tggatgatgc tgaggtggatga 22 <210> SEQ ID NO 32 <211> LENGTH: 22 <212> TYPE: DNA <213>ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION:Synthetically generated primer <400> SEQUENCE: 32 ggggtgcgtg atgacatcatag 22 <210> SEQ ID NO 33 <211> LENGTH: 24 <212> TYPE: DNA <213>ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION:Synthetically generated primer <400> SEQUENCE: 33 gagtgcagat aagggtggcttttt 24 <210> SEQ ID NO 34 <211> LENGTH: 24 <212> TYPE: DNA <213>ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION:Synthetically generated primer <400> SEQUENCE: 34 ccaattcacc atagtcttcggcta 24 <210> SEQ ID NO 35 <211> LENGTH: 22 <212> TYPE: DNA <213>ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION:Synthetically generated primer <400> SEQUENCE: 35 gaaacgggag gagctgaaaaag 22 <210> SEQ ID NO 36 <211> LENGTH: 22 <212> TYPE: DNA <213>ORGANISM: Artificial Sequence <220> FEATURE: <223> OTHER INFORMATION:Synthetically generated primer <400> SEQUENCE: 36 ccttcagggg tatccaccaatc 22 <210> SEQ ID NO 37 <211> LENGTH: 59 <212> TYPE: PRT <213>ORGANISM: Homo sapiens <400> SEQUENCE: 37 Ser Trp Pro Phe Leu Lys LeuVal Ser Lys Ile Gln Val Pro Asp Tyr 1 5 10 15 Tyr Asp Ile Ile Lys LysPro Ile Ala Leu Asn Ile Ile Arg Glu Lys 20 25 30 Val Asn Lys Cys Glu TyrLys Leu Ala Ser Glu Phe Ile Asp Asp Ile 35 40 45 Glu Leu Met Phe Ser AsnCys Phe Glu Tyr Asn 50 55 <210> SEQ ID NO 38 <211> LENGTH: 59 <212>TYPE: PRT <213> ORGANISM: Saccharomyces cerevisiae <400> SEQUENCE: 38Ser Trp Pro Phe Leu Lys Pro Val Asn Lys Lys Gln Val Lys Asp Tyr 1 5 1015 Tyr Thr Val Ile Lys Arg Pro Met Asp Ile Glu Ile Ile Gly Lys Asn 20 2530 Ile Glu Ala His Arg Tyr His Ser Arg Ala Glu Tyr Leu Ala Asp Ile 35 4045 Glu Leu Ile Ala Thr Asn Cys Glu Gln Tyr Asn 50 55 <210> SEQ ID NO 39<211> LENGTH: 59 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400>SEQUENCE: 39 Thr Tyr Pro Phe His Thr Pro Val Asn Ala Lys Val Val Lys AspTyr 1 5 10 15 Tyr Lys Ile Ile Thr Arg Pro Met Asp Leu Gln Ile Leu ArgGlu Asn 20 25 30 Val Arg Lys Arg Leu Tyr Pro Ser Arg Glu Glu Phe Arg GluHis Leu 35 40 45 Glu Leu Ile Val Lys Asn Ser Ala Thr Tyr Asn 50 55 <210>SEQ ID NO 40 <211> LENGTH: 59 <212> TYPE: PRT <213> ORGANISM: Homosapiens <400> SEQUENCE: 40 Ala Trp Pro Phe Met Glu Pro Val Lys Arg ThrGlu Ala Pro Gly Tyr 1 5 10 15 Tyr Glu Val Ile Arg Ser Pro Met Asp LeuLys Ile Met Ser Glu Arg 20 25 30 Leu Lys Asn Arg Tyr Tyr Val Ser Lys LysLeu Phe Met Ala Asp Leu 35 40 45 Gln Arg Val Phe Thr Asn Cys Lys Glu TyrAsn 50 55 <210> SEQ ID NO 41 <211> LENGTH: 61 <212> TYPE: PRT <213>ORGANISM: Homo sapiens <400> SEQUENCE: 41 Ser Leu Pro Phe Arg Gln ProVal Asp Pro Gln Leu Leu Gly Ile Pro 1 5 10 15 Asp Tyr Glu Asp Ile ValLys Asn Pro Met Asp Leu Ser Ile Ile Lys 20 25 30 Arg Lys Leu Asp Thr GlyGln Tyr Gln Glu Pro Trp Gln Tyr Val Asp 35 40 45 Asp Val Trp Leu Met PheAsn Asn Ala Trp Leu Tyr Asn 50 55 60 <210> SEQ ID NO 42 <211> LENGTH: 45<212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 42 Cys LysIle Cys Arg Lys Lys Gly Asp Ala Glu Asn Met Val Leu Cys 1 5 10 15 AspGly Cys Asp Arg Gly His His Thr Tyr Cys Val Arg Pro Lys Leu 20 25 30 LysThr Val Pro Glu Gly Asp Trp Phe Cys Pro Glu Cys 35 40 45 <210> SEQ ID NO43 <211> LENGTH: 44 <212> TYPE: PRT <213> ORGANISM: Caenorhabditiselegans <400> SEQUENCE: 43 Cys Gln Ile Cys Lys Ser Met Asp Gly Asp GluMet Leu Val Cys Asp 1 5 10 15 Gly Cys Glu Ser Gly Cys His Met Glu CysPhe Arg Pro Arg Met Thr 20 25 30 Lys Val Pro Glu Gly Asp Trp Phe Cys GlnArg Cys 35 40 <210> SEQ ID NO 44 <211> LENGTH: 45 <212> TYPE: PRT <213>ORGANISM: Homo sapiens <400> SEQUENCE: 44 Cys Met Phe Cys Gly Arg GlyAsn Asn Glu Asp Lys Leu Leu Leu Cys 1 5 10 15 Asp Gly Cys Asp Asp SerTyr His Thr Phe Cys Leu Ile Pro Pro Leu 20 25 30 Pro Asp Val Pro Lys GlyAsp Trp Arg Cys Pro Lys Cys 35 40 45 <210> SEQ ID NO 45 <211> LENGTH: 54<212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 45 Cys SerPhe Cys Leu Gly Thr Lys Glu Gln Asn Arg Glu Lys Lys Pro 1 5 10 15 GluGlu Leu Ile Ser Cys Ala Asp Cys Gly Asn Ser Gly His Pro Ser 20 25 30 CysLeu Lys Phe Ser Pro Glu Leu Thr Val Arg Val Lys Ala Leu Arg 35 40 45 TrpGln Cys Ile Glu Cys 50 <210> SEQ ID NO 46 <211> LENGTH: 46 <212> TYPE:PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 46 Cys Ser Ser Cys ArgAsp Gln Gly Lys Asn Ala Asp Asn Met Leu Phe 1 5 10 15 Cys Asp Ser CysAsp Arg Gly Phe His Met Glu Cys Cys Asp Pro Pro 20 25 30 Leu Thr Arg MetPro Lys Gly Met Trp Ile Cys Gln Ile Cys 35 40 45 <210> SEQ ID NO 47<211> LENGTH: 75 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400>SEQUENCE: 47 Cys Glu Lys Cys Phe Asn Glu Ile Gln Gly Glu Ser Val Ser LeuGly 1 5 10 15 Asp Asp Pro Ser Gln Pro Gln Thr Thr Ile Asn Asn Glu GlnPhe Ser 20 25 30 Lys Arg Lys Asn Asp Thr Leu Asp Pro Glu Leu Phe Val GluCys Thr 35 40 45 Glu Cys Gly Arg Lys Met His Gln Ile Cys Val Leu His HisGlu Ile 50 55 60 Ile Trp Pro Ala Gly Phe Val Cys Asp Gly Cys 65 70 75<210> SEQ ID NO 48 <211> LENGTH: 75 <212> TYPE: PRT <213> ORGANISM: Homosapiens <400> SEQUENCE: 48 Cys Glu Lys Cys Phe Thr Glu Ile Gln Gly GluAsn Val Thr Leu Gly 1 5 10 15 Asp Asp Pro Ser Gln Pro Gln Thr Thr IleSer Lys Asp Gln Phe Glu 20 25 30 Lys Lys Lys Asn Asp Thr Leu Asp Pro GluPro Phe Val Asp Cys Lys 35 40 45 Glu Cys Gly Arg Lys Met His Gln Ile CysVal Leu His Tyr Asp Ile 50 55 60 Ile Trp Pro Ser Gly Phe Val Cys Asp AsnCys 65 70 75 <210> SEQ ID NO 49 <211> LENGTH: 59 <212> TYPE: PRT <213>ORGANISM: Homo sapiens <400> SEQUENCE: 49 Ser Trp Pro Phe Leu Lys LeuVal Ser Lys Ile Gln Val Pro Asp Tyr 1 5 10 15 Tyr Asp Ile Ile Lys LysPro Ile Ala Leu Asn Ile Ile Arg Glu Lys 20 25 30 Val Asn Lys Cys Glu TyrLys Leu Ala Ser Glu Phe Ile Asp Asp Ile 35 40 45 Glu Leu Met Phe Ser AsnCys Phe Glu Tyr Asn 50 55 <210> SEQ ID NO 50 <211> LENGTH: 59 <212>TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 50 Ser Trp ProPhe His His Pro Val Asn Lys Lys Phe Val Pro Asp Tyr 1 5 10 15 Tyr LysVal Ile Val Asn Pro Met Asp Ile Glu Thr Ile Arg Lys Asn 20 25 30 Ile SerLys His Lys Tyr Gln Ser Arg Glu Ser Phe Leu Asp Asp Val 35 40 45 Asn LeuIle Leu Ala Asn Ser Val Lys Tyr Asn 50 55 <210> SEQ ID NO 51 <211>LENGTH: 59 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE:51 Ala Trp Pro Phe Leu Glu Pro Val Asn Pro Arg Leu Val Ser Gly Tyr 1 510 15 Arg Arg Ile Ile Lys Asn Pro Met Asp Phe Ser Thr Met Arg Glu Arg 2025 30 Leu Leu Arg Gly Gly Tyr Thr Ser Ser Glu Glu Phe Ala Ala Asp Ala 3540 45 Leu Leu Val Phe Asp Asn Cys Gln Thr Phe Asn 50 55 <210> SEQ ID NO52 <211> LENGTH: 59 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400>SEQUENCE: 52 Ala Trp Pro Phe Met Glu Pro Val Lys Arg Thr Glu Ala Pro GlyTyr 1 5 10 15 Tyr Glu Val Ile Arg Ser Pro Met Asp Leu Lys Thr Met SerGlu Arg 20 25 30 Leu Lys Asn Arg Tyr Tyr Val Ser Lys Lys Leu Phe Met AlaAsp Leu 35 40 45 Gln Arg Val Phe Thr Asn Cys Lys Glu Tyr Asn 50 55 <210>SEQ ID NO 53 <211> LENGTH: 82 <212> TYPE: PRT <213> ORGANISM:Caenorhabditis elegans <400> SEQUENCE: 53 Ala Leu Pro Phe Leu Glu ProVal Asn Pro Lys Leu Val Pro Gly Tyr 1 5 10 15 Lys Met Ile Ile Ser LysPro Met Asp Leu Lys Thr Ile Arg Gln Lys 20 25 30 Asn Glu Lys Leu Ile ValSer Glu Thr Tyr Gln Phe Cys Phe Phe Ala 35 40 45 Ile Phe Asp Leu Lys LeuLys Met Lys Ile Thr Gln Tyr Glu Thr Pro 50 55 60 Glu Asp Phe Ala Glu AspIle Glu Leu Met Phe Ala Asn Cys Arg Gln 65 70 75 80 Phe Asn <210> SEQ IDNO 54 <211> LENGTH: 61 <212> TYPE: PRT <213> ORGANISM: Homo sapiens<400> SEQUENCE: 54 Ser Leu Pro Phe Arg Gln Pro Val Asp Pro Gln Leu LeuGly Ile Pro 1 5 10 15 Asp Tyr Phe Asp Ile Val Lys Met Pro Met Asp LeuSer Thr Ile Lys 20 25 30 Arg Lys Leu Asp Thr Gly Gln Tyr Gln Glu Pro TrpGln Tyr Val Asp 35 40 45 Asp Val Trp Leu Met Phe Asn Asn Ala Trp Leu TyrAsn 50 55 60 <210> SEQ ID NO 55 <211> LENGTH: 45 <212> TYPE: PRT <213>ORGANISM: Homo sapiens <400> SEQUENCE: 55 Cys Lys Ile Cys Arg Lys LysGly Asp Ala Glu Asn Met Val Leu Cys 1 5 10 15 Asp Gly Cys Asp Arg GlyHis His Thr Tyr Cys Val Arg Pro Lys Leu 20 25 30 Lys Thr Val Pro Glu GlyAsp Trp Phe Cys Pro Glu Cys 35 40 45 <210> SEQ ID NO 56 <211> LENGTH: 44<212> TYPE: PRT <213> ORGANISM: Caenorhabditis elegans <400> SEQUENCE:56 Cys Gln Ile Cys Lys Ser Met Asp Gly Asp Glu Met Leu Val Cys Asp 1 510 15 Gly Cys Glu Ser Gly Cys His Met Glu Cys Phe Arg Pro Arg Met Thr 2025 30 Lys Val Pro Glu Gly Asp Trp Phe Cys Gln Arg Cys 35 40 <210> SEQ IDNO 57 <211> LENGTH: 44 <212> TYPE: PRT <213> ORGANISM: Homo sapiens<400> SEQUENCE: 57 Cys Met Phe Cys Gly Arg Gly Asn Asn Glu Asp Lys LeuLeu Leu Cys 1 5 10 15 Asp Gly Cys Asp Ser Tyr His Thr Phe Cys Leu IlePro Pro Leu Pro 20 25 30 Asp Val Pro Lys Gly Asp Trp Arg Cys Pro Lys Cys35 40 <210> SEQ ID NO 58 <211> LENGTH: 54 <212> TYPE: PRT <213>ORGANISM: Homo sapiens <400> SEQUENCE: 58 Cys Ser Phe Cys Leu Gly ThrLys Glu Gln Asn Arg Glu Lys Lys Pro 1 5 10 15 Glu Glu Leu Ile Ser CysAla Asp Cys Gly Asn Ser Gly His Pro Ser 20 25 30 Cys Leu Lys Phe Ser ProGlu Leu Thr Val Arg Val Lys Ala Leu Arg 35 40 45 Trp Gln Cys Ile Glu Cys50 <210> SEQ ID NO 59 <211> LENGTH: 46 <212> TYPE: PRT <213> ORGANISM:Homo sapiens <400> SEQUENCE: 59 Cys Ser Ser Cys Arg Asp Gln Gly Lys AsnAla Asp Asn Met Leu Phe 1 5 10 15 Cys Asp Ser Cys Asp Arg Gly Phe HisMet Glu Cys Cys Asp Pro Pro 20 25 30 Leu Thr Arg Met Pro Lys Gly Met TrpIle Cys Gln Ile Cys 35 40 45 <210> SEQ ID NO 60 <211> LENGTH: 75 <212>TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 60 Cys Glu LysCys Phe Asn Glu Ile Gln Gly Glu Ser Val Ser Leu Gly 1 5 10 15 Asp AspPro Ser Gln Pro Gln Thr Thr Ile Asn Lys Glu Gln Phe Ser 20 25 30 Lys ArgLys Asn Asp Thr Leu Asp Pro Glu Leu Phe Val Glu Cys Thr 35 40 45 Glu CysGly Arg Lys Met His Gln Ile Cys Val Leu His His Glu Ile 50 55 60 Ile TrpPro Ala Gly Phe Val Cys Asp Gly Cys 65 70 75 <210> SEQ ID NO 61 <211>LENGTH: 45 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE:61 Cys Leu Val Cys Arg Lys Gly Asp Asn Asp Arg Phe Leu Leu Leu Cys 1 510 15 Asp Gly Cys Asp Arg Gly Cys His Ile Tyr Cys His Arg Pro Lys Met 2025 30 Glu Ala Val Pro Glu Gly Asp Trp Phe Cys Thr Val Cys 35 40 45 <210>SEQ ID NO 62 <211> LENGTH: 211 <212> TYPE: PRT <213> ORGANISM: Homosapiens <400> SEQUENCE: 62 Arg Glu Glu Lys Arg Lys Tyr Val Glu Tyr LeuLys Gln Trp Ser Lys 1 5 10 15 Pro Arg Glu Asp Met Glu Cys Asp Asp LeuLys Glu Leu Pro Glu Pro 20 25 30 Thr Pro Val Lys Thr Arg Leu Pro Pro GluIle Phe Gly Asp Ala Leu 35 40 45 Met Val Leu Glu Phe Leu Asn Ala Phe GlyGlu Leu Phe Asp Leu Gln 50 55 60 Asp Glu Phe Pro Asp Gly Val Thr Leu GluVal Leu Glu Glu Ala Leu 65 70 75 80 Val Gly Asn Asp Ser Glu Gly Pro LeuCys Glu Leu Leu Phe Phe Phe 85 90 95 Leu Thr Ala Ile Phe Gln Ala Ile AlaGlu Glu Glu Glu Glu Val Ala 100 105 110 Lys Glu Gln Leu Thr Asp Ala AspThr Lys Gly Cys Ser Leu Lys Ser 115 120 125 Leu Asp Leu Asp Ser Cys ThrLeu Ser Glu Ile Leu Arg Leu His Ile 130 135 140 Leu Ala Ser Gly Ala AspVal Thr Ser Ala Asn Ala Lys Tyr Arg Tyr 145 150 155 160 Gln Lys Arg GlyGly Phe Asp Ala Thr Asp Asp Ala Cys Met Glu Leu 165 170 175 Arg Leu SerAsn Pro Ser Leu Val Lys Lys Leu Ser Ser Thr Ser Val 180 185 190 Tyr AspLeu Thr Pro Gly Glu Lys Met Lys Ile Leu His Ala Leu Cys 195 200 205 GlyLys Leu 210 <210> SEQ ID NO 63 <211> LENGTH: 185 <212> TYPE: PRT <213>ORGANISM: Caenorhabditis elegans <400> SEQUENCE: 63 Leu Asn Asp Glu PheThr Glu Glu Leu Val His Ser Gln Ile Met Ser 1 5 10 15 Asn Gly Val AspGlu Cys Lys Ile Arg Glu Arg Glu Ala Asp Asp Leu 20 25 30 Leu Val Asn IleAsn Asp Val Arg His Leu Pro Asp Phe Ser Arg Ile 35 40 45 Gly Asn Gln CysLeu Ser Ser Gln Gly Phe Ala Asp Ala Leu Met Val 50 55 60 His Glu Phe ValGln Asn Phe Gly His Val Leu Gly Ile Asp Leu Glu 65 70 75 80 Ile Ala ProLys Leu Glu Ser Leu Cys Ala Gly Leu Asp Gly Asp Ala 85 90 95 Asn His AlaGlu Gln Thr Leu Gln Leu Thr Arg Gln Leu Leu Arg Leu 100 105 110 Ala LeuGlu Phe Pro Gly Met Gly Asn Glu Lys Arg Phe Gly Gln Gly 115 120 125 GlyGly Glu Met Gly Leu Asp Arg Glu Asn Phe Ser Glu Val Met Arg 130 135 140Leu Phe Leu Ile Asp Lys Gly Lys Arg Gly Glu Glu Leu Ser Gln Pro 145 150155 160 Leu Leu Thr Cys Asn Phe Leu Ser Ile Ser Pro Glu Gln Lys Ala Ser165 170 175 Ile Leu Ala Phe Leu Cys Asp Glu Leu 180 185 <210> SEQ ID NO64 <211> LENGTH: 175 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400>SEQUENCE: 64 Leu Glu Glu Arg Gln Lys Gln Gln Met Ile Leu Glu Glu Met LysLys 1 5 10 15 Pro Thr Glu Asp Met Cys Leu Thr Asp His Gln Pro Leu ProAsp Phe 20 25 30 Ser Arg Val Pro Gly Leu Thr Leu Pro Ser Gly Ala Phe SerAsp Cys 35 40 45 Leu Thr Ile Val Glu Phe Leu His Ser Phe Gly Lys Val LeuGly Phe 50 55 60 Asp Pro Ala Lys Asp Val Pro Ser Leu Gly Val Leu Gln GluGly Leu 65 70 75 80 Leu Cys Gln Gly Asp Ser Leu Gly Glu Val Gln Asp LeuLeu Val Arg 85 90 95 Leu Leu Lys Ala Ala Leu His Asp Pro Gly Phe Pro SerTyr Cys Gln 100 105 110 Ser Leu Lys Ile Leu Gly Glu Lys Val Ser Glu IlePro Leu Thr Arg 115 120 125 Asp Asn Val Ser Glu Ile Leu Arg Cys Pro LeuMet Ala Tyr Gly Val 130 135 140 Glu Pro Ala Leu Cys Asp Arg Leu Arg ThrGln Pro Pro Gln Ala Gln 145 150 155 160 Pro Pro Gln Lys Ala Ala Val LeuAla Phe Pro Val His Glu Leu 165 170 175 <210> SEQ ID NO 65 <211> LENGTH:176 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 65 LeuGlu Gln Arg Arg Leu Glu Leu Glu Met Ala Lys Glu Leu Lys Lys 1 5 10 15Pro Asn Glu Asp Met Cys Leu Ala Asp Gln Lys Pro Leu Pro Glu Leu 20 25 30Pro Arg Ile Pro Gly Leu Val Leu Ser Gly Ser Thr Phe Ser Asp Cys 35 40 45Leu Met Val Val Gln Phe Leu Arg Asn Phe Gly Lys Val Leu Gly Phe 50 55 60Asp Val Asn Ile Asp Val Pro Asn Leu Ser Val Leu Gln Glu Gly Leu 65 70 7580 Leu Asn Ile Gly Asp Ser Met Gly Glu Val Gln Asp Leu Leu Val Arg 85 9095 Leu Leu Ser Ala Ala Val Cys Asp Pro Gly Leu Ile Thr Gly Tyr Lys 100105 110 Ala Lys Thr Ala Leu Gly Glu His Leu Leu Asn Val Gly Val Asn Arg115 120 125 Asp Asn Val Ser Glu Ile Leu Gln Ile Phe Met Glu Ala His CysGly 130 135 140 Gln Thr Glu Leu Thr Glu Ser Leu Lys Thr Lys Ala Phe GlnAla His 145 150 155 160 Thr Pro Ala Gln Lys Ala Ser Val Leu Ala Phe LeuIle Asn Glu Leu 165 170 175 <210> SEQ ID NO 66 <211> LENGTH: 59 <212>TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 66 Tyr Pro IleThr Ala Val Ser Leu Met Glu Ala Leu Ser Ala Asp Lys 1 5 10 15 Gly GlyPhe Leu Tyr Leu Asn Arg Val Leu Val Ile Leu Leu Gln Thr 20 25 30 Leu LeuGln Asp Glu Ile Ala Glu Asp Tyr Gly Glu Leu Gly Met Lys 35 40 45 Leu SerGlu Ile Pro Leu His Ser Val Ser Glu 50 55 <210> SEQ ID NO 67 <211>LENGTH: 65 <212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE:67 Tyr Pro Ile Thr Ala Val Ser Leu Met Glu Ala Leu Ser Ala Asp Lys 1 510 15 Gly Gly Phe Leu Tyr Leu Asn Arg Val Leu Val Ile Leu Leu Gln Thr 2025 30 Leu Leu Gln Thr Leu Leu Gln Asp Glu Ile Ala Glu Asp Tyr Gly Glu 3540 45 Leu Gly Met Lys Leu Ser Lys Ile Pro Leu Thr Leu His Ser Val Ser 5055 60 Glu 65 <210> SEQ ID NO 68 <211> LENGTH: 1525 <212> TYPE: PRT <213>ORGANISM: Homo sapiens <400> SEQUENCE: 68 Met Ala Pro Leu Leu Gly ArgLys Pro Phe Pro Leu Val Asn Pro Leu 1 5 10 15 Pro Gly Glu Glu Pro PhePhe Thr Ile Pro His Thr Gln Glu Ala Phe 20 25 30 Arg Thr Arg Glu Glu TyrGlu Ala Arg Leu Glu Arg Tyr Ser Glu Arg 35 40 45 Ile Trp Thr Cys Lys SerThr Gly Ser Ser Gln Leu Thr His Lys Glu 50 55 60 Ala Trp Glu Glu Glu GlnGlu Val Ala Glu Leu Leu Lys Glu Glu Phe 65 70 75 80 Pro Ala Trp Tyr GluLys Leu Val Leu Glu Met Val His His Asn Thr 85 90 95 Ala Ser Leu Glu LysLeu Val Asp Thr Ala Trp Leu Glu Ile Met Thr 100 105 110 Lys Tyr Ala ValGly Glu Glu Cys Asp Phe Glu Val Gly Lys Glu Lys 115 120 125 Met Leu LysVal Lys Ile Val Lys Ile His Pro Leu Glu Lys Val Asp 130 135 140 Glu GluAla Thr Glu Lys Lys Ser Asp Gly Ala Cys Asp Ser Pro Ser 145 150 155 160Ser Asp Lys Glu Asn Ser Ser Gln Ile Ala Gln Asp His Gln Lys Lys 165 170175 Glu Thr Val Val Lys Glu Asp Glu Gly Arg Arg Glu Ser Ile Asn Asp 180185 190 Arg Ala Arg Arg Ser Pro Arg Lys Leu Pro Thr Ser Leu Lys Lys Gly195 200 205 Glu Arg Lys Trp Ala Pro Pro Lys Phe Leu Pro His Lys Tyr AspVal 210 215 220 Lys Leu Gln Asn Glu Asp Lys Ile Ile Ser Asn Val Pro AlaAsp Ser 225 230 235 240 Leu Ile Arg Thr Glu Arg Pro Pro Asn Lys Glu IleVal Arg Tyr Phe 245 250 255 Ile Arg His Asn Ala Leu Arg Ala Gly Thr GlyGlu Asn Ala Pro Trp 260 265 270 Val Val Glu Asp Glu Leu Val Lys Lys TyrSer Leu Pro Ser Lys Phe 275 280 285 Ser Asp Phe Leu Leu Asp Pro Tyr LysTyr Met Thr Leu Asn Pro Ser 290 295 300 Thr Lys Arg Lys Asn Thr Gly SerPro Asp Arg Lys Pro Ser Lys Lys 305 310 315 320 Ser Lys Thr Asp Asn SerSer Leu Ser Ser Pro Leu Asn Pro Lys Leu 325 330 335 Trp Cys His Val HisLeu Lys Lys Ser Leu Ser Gly Ser Pro Leu Lys 340 345 350 Val Lys Asn SerLys Asn Ser Lys Ser Pro Glu Glu His Leu Glu Glu 355 360 365 Met Met LysMet Met Ser Pro Asn Lys Leu His Thr Asn Pro His Ile 370 375 380 Pro LysLys Gly Pro Pro Ala Lys Lys Pro Gly Lys His Ser Asp Lys 385 390 395 400Pro Leu Lys Ala Lys Gly Arg Ser Lys Gly Ile Leu Asn Gly Gln Lys 405 410415 Ser Thr Gly Asn Ser Lys Ser Pro Lys Lys Gly Leu Lys Thr Pro Lys 420425 430 Thr Lys Met Lys Gln Met Thr Leu Leu Asp Met Ala Lys Gly Thr Gln435 440 445 Lys Met Thr Arg Ala Pro Arg Asn Ser Gly Gly Thr Pro Arg ThrSer 450 455 460 Ser Lys Pro His Lys His Leu Pro Pro Ala Ala Leu His LeuIle Ala 465 470 475 480 Tyr Tyr Lys Glu Asn Lys Asp Arg Glu Asp Lys ArgSer Ala Leu Ser 485 490 495 Cys Val Ile Ser Lys Thr Ala Arg Leu Leu SerSer Glu Asp Arg Ala 500 505 510 Arg Leu Pro Glu Glu Leu Arg Ser Leu ValGln Lys Arg Tyr Glu Leu 515 520 525 Leu Glu His Lys Lys Arg Trp Ala SerMet Ser Glu Glu Gln Arg Lys 530 535 540 Glu Tyr Leu Lys Lys Lys Arg GluGlu Leu Lys Lys Lys Leu Lys Glu 545 550 555 560 Lys Ala Lys Glu Arg ArgGlu Lys Glu Met Leu Glu Arg Leu Glu Lys 565 570 575 Gln Lys Arg Tyr GluAsp Gln Glu Leu Thr Gly Lys Asn Leu Pro Ala 580 585 590 Phe Arg Leu ValAsp Thr Pro Glu Gly Leu Pro Asn Thr Leu Phe Gly 595 600 605 Asp Val AlaMet Val Val Glu Phe Leu Ser Cys Tyr Ser Gly Leu Leu 610 615 620 Leu ProAsp Ala Gln Tyr Pro Ile Thr Ala Val Ser Leu Met Glu Ala 625 630 635 640Leu Ala Asp Lys Gly Gly Phe Leu Tyr Leu Asn Arg Val Leu Val Ile 645 650655 Leu Leu Gln Thr Leu Leu Gln Asp Glu Ile Ala Glu Asp Tyr Gly Glu 660665 670 Leu Gly Met Lys Leu Ser Glu Ile Pro Leu Thr Leu His Ser Val Ser675 680 685 Glu Leu Val Arg Leu Cys Leu Arg Arg Ser Asp Val Gln Glu GluSer 690 695 700 Glu Gly Ser Asp Thr Asp Asp Asn Lys Asp Ser Ala Ala PheGlu Asp 705 710 715 720 Asn Glu Val Gln Asp Glu Phe Leu Glu Lys Leu GluThr Ser Glu Phe 725 730 735 Phe Glu Leu Thr Ser Glu Glu Lys Leu Gln IleLeu Thr Ala Leu Cys 740 745 750 His Arg Ile Leu Met Thr Tyr Ser Val GlnAsp His Met Glu Thr Arg 755 760 765 Gln Gln Met Ser Ala Glu Leu Trp LysGlu Arg Leu Ala Val Leu Lys 770 775 780 Glu Glu Asn Asp Lys Lys Arg AlaGlu Lys Gln Lys Arg Lys Glu Met 785 790 795 800 Glu Ala Lys Asn Lys GluAsn Gly Lys Val Glu Asn Gly Leu Gly Lys 805 810 815 Thr Asp Arg Lys LysArg Ile Val Lys Phe Glu Pro Gln Val Asp Thr 820 825 830 Glu Ala Glu AspMet Ile Ser Ala Val Lys Ser Arg Arg Leu Leu Ala 835 840 845 Ile Gln AlaLys Lys Glu Arg Glu Ile Gln Glu Arg Glu Met Lys Val 850 855 860 Lys LeuGlu Arg Gln Ala Glu Glu Glu Arg Ile Arg Lys His Lys Ala 865 870 875 880Ala Ala Glu Lys Ala Phe Gln Glu Gly Ile Ala Lys Ala Lys Leu Val 885 890895 Met Arg Arg Thr Pro Ile Gly Thr Asp Arg Asn His Asn Arg Tyr Trp 900905 910 Leu Phe Ser Asp Glu Val Pro Gly Leu Phe Ile Glu Lys Gly Trp Val915 920 925 His Asp Ser Ile Asp Tyr Arg Phe Asn His His Cys Lys Asp HisThr 930 935 940 Val Ser Gly Cys Glu Asp Tyr Cys Pro Arg Ser Lys Lys AlaAsn Leu 945 950 955 960 Gly Lys Asn Ala Ser Met Asn Thr Gln His Gly ThrAla Thr Glu Val 965 970 975 Ala Val Glu Thr Thr Thr Pro Lys Gln Gly GlnAsn Leu Trp Phe Leu 980 985 990 Cys Asp Ser Gln Lys Glu Leu Asp Glu LeuLeu Asn Cys Leu His Pro 995 1000 1005 Gln Gly Ile Arg Glu Ser Gln LeuLys Glu Arg Leu Glu Lys Arg Tyr 1010 1015 1020 Gln Asp Ile Ile His SerLeu His Leu Ala Arg Lys Pro Asn Leu Gly 1025 1030 1035 1040 Leu Lys SerCys Asp Gly Asn Gln Glu Leu Leu Asn Phe Leu Arg Ser 1045 1050 1055 AspLeu Ile Glu Val Ala Thr Arg Leu Gln Lys Gly Gly Leu Gly Tyr 1060 10651070 Val Glu Glu Thr Ser Glu Phe Glu Ala Arg Val Ile Ser Leu Glu Lys1075 1080 1085 Leu Lys Asp Phe Gly Glu Cys Val Ile Ala Leu Gln Ala SerVal Ile 1090 1095 1100 Lys Lys Phe Leu Gln Gly Phe Met Ala Pro Lys GlnLys Arg Arg Lys 1105 1110 1115 1120 Leu Gln Ser Glu Asp Ser Ala Lys ThrGlu Glu Val Asp Glu Glu Lys 1125 1130 1135 Lys Met Val Glu Glu Ala LysVal Ala Ser Ala Leu Glu Lys Trp Lys 1140 1145 1150 Thr Ala Ile Arg GluAla Gln Thr Phe Ser Arg Met His Val Leu Leu 1155 1160 1165 Gly Met LeuAsp Ala Cys Ile Lys Trp Asp Met Ser Ala Glu Asn Ala 1170 1175 1180 ArgCys Lys Val Cys Pro Lys Lys Gly Glu Asp Asp Lys Leu Ile Leu 1185 11901195 1200 Cys Asp Glu Cys Asn Lys Ala Phe His Leu Phe Cys Leu Arg ProAla 1205 1210 1215 Leu Tyr Glu Val Pro Asp Gly Glu Trp Gln Cys Pro AlaCys Gln Pro 1220 1225 1230 Ala Thr Ala Arg Arg Asn Ser Arg Gly Arg AsnTyr Thr Glu Glu Ser 1235 1240 1245 Ala Ser Glu Asp Ser Glu Asp Glu SerAsp Glu Glu Glu Glu Glu Glu 1250 1255 1260 Glu Glu Glu Glu Glu Glu GluAsp Tyr Glu Val Ala Gly Leu Arg Leu 1265 1270 1275 1280 Arg Pro Arg LysThr Ile Arg Gly Lys His Ser Val Ile Pro Pro Ala 1285 1290 1295 Ala ArgSer Gly Arg Arg Pro Gly Lys Lys Pro His Ser Thr Arg Arg 1300 1305 1310Ser Gln Pro Lys Ala Pro Pro Val Asp Ala Glu Val Asp Glu Leu Val 13151320 1325 Leu Gln Thr Lys Arg Ser Ser Arg Arg Gln Ser Leu Glu Leu GlnLys 1330 1335 1340 Cys Glu Glu Ile Leu His Lys Ile Val Lys Tyr Arg PheSer Trp Pro 1345 1350 1355 1360 Phe Arg Glu Pro Val Thr Arg Asp Glu AlaGlu Asp Tyr Tyr Asp Val 1365 1370 1375 Ile Thr His Pro Met Asp Phe GlnThr Val Gln Asn Lys Cys Ser Cys 1380 1385 1390 Gly Ser Tyr Arg Ser ValGln Glu Phe Leu Thr Asp Met Lys Gln Val 1395 1400 1405 Phe Thr Asn AlaGlu Val Tyr Asn Cys Arg Gly Ser His Val Leu Ser 1410 1415 1420 Cys MetVal Lys Thr Glu Gln Cys Leu Val Val Leu Leu His Lys His 1425 1430 14351440 Leu Pro Gly His Pro Tyr Val Arg Arg Lys Arg Lys Lys Phe Pro Asp1445 1450 1455 Arg Leu Ala Glu Asp Glu Gly Asp Ser Glu Pro Glu Ala ValGly Gln 1460 1465 1470 Ser Arg Asp Glu Asp Arg Arg Ser Arg Glu Ala GluIle Gln Glu Trp 1475 1480 1485 Leu Gln Asp Thr Ser Leu Tyr Ala Ser AlaLys Ile Asn Ser Lys Asp 1490 1495 1500 His Asn Cys Phe Met Met Leu ValAsn Thr Gln Phe Cys Met Ala Leu 1505 1510 1515 1520 Thr Asp Thr Val Thr1525 <210> SEQ ID NO 69 <211> LENGTH: 1673 <212> TYPE: PRT <213>ORGANISM: Homo sapiens <400> SEQUENCE: 69 Met Glu Asp Ala Ser Glu SerSer Arg Gly Val Ala Pro Leu Ile Asn 1 5 10 15 Asn Val Val Leu Pro GlySer Pro Leu Ser Leu Pro Val Ser Val Thr 20 25 30 Gly Cys Lys Ser His ArgVal Ala Asn Lys Lys Val Glu Ala Arg Ser 35 40 45 Glu Lys Leu Leu Pro ThrAla Leu Pro Pro Ser Glu Pro Lys Val Asp 50 55 60 Gln Lys Leu Pro Arg SerSer Glu Arg Arg Gly Ser Gly Gly Gly Thr 65 70 75 80 Gln Phe Pro Ala ArgSer Arg Ala Val Ala Ala Gly Glu Ala Ala Ala 85 90 95 Arg Gly Ala Ala GlyPro Glu Arg Gly Ser Pro Leu Gly Arg Arg Val 100 105 110 Ser Pro Arg CysLeu Cys Ser Gly Glu Gly Gly Gln Val Ala Val Gly 115 120 125 Val Ile AlaGly Lys Arg Gly Arg Arg Gly Arg Asp Gly Ser Arg Arg 130 135 140 Ala ProGly Gly Arg Glu Met Pro Leu Leu His Arg Lys Pro Phe Val 145 150 155 160Arg Gln Lys Pro Pro Ala Asp Leu Arg Pro Asp Glu Glu Val Phe Tyr 165 170175 Cys Lys Val Thr Asn Glu Ile Phe Arg His Tyr Asp Asp Phe Phe Glu 180185 190 Arg Thr Ile Leu Cys Asn Ser Leu Val Trp Ser Cys Ala Val Thr Gly195 200 205 Arg Pro Gly Leu Thr Tyr Gln Glu Ala Leu Glu Ser Glu Lys LysAla 210 215 220 Arg Gln Asn Leu Gln Ser Phe Pro Glu Pro Leu Ile Ile ProVal Leu 225 230 235 240 Tyr Leu Thr Ser Leu Thr His Arg Ser Arg Leu HisGlu Ile Cys Asp 245 250 255 Asp Ile Phe Ala Tyr Val Lys Asp Arg Tyr PheVal Glu Glu Thr Val 260 265 270 Glu Val Ile Arg Asn Asn Gly Ala Arg LeuGln Cys Thr Ile Leu Glu 275 280 285 Val Leu Pro Pro Ser His Gln Asn GlyPhe Ala Asn Gly His Val Asn 290 295 300 Ser Val Asp Gly Glu Thr Ile IleIle Ser Asp Ser Asp Asp Ser Glu 305 310 315 320 Thr Gln Ser Cys Ser PheGln Asn Gly Lys Lys Lys Asp Ala Ile Asp 325 330 335 Pro Leu Leu Phe LysTyr Lys Val Gln Pro Thr Lys Lys Glu Leu His 340 345 350 Glu Ser Ala IleVal Lys Ala Thr Gln Ile Ser Arg Arg Lys His Leu 355 360 365 Phe Ser ArgAsp Lys Leu Lys Leu Phe Leu Lys Gln His Cys Glu Pro 370 375 380 Gln GluGly Val Ile Lys Ile Lys Ala Ser Ser Leu Ser Thr Tyr Lys 385 390 395 400Ile Ala Glu Gln Asp Phe Ser Tyr Phe Phe Pro Asp Asp Pro Pro Thr 405 410415 Phe Ile Phe Ser Pro Ala Asn Arg Arg Arg Gly Arg Pro Pro Lys Arg 420425 430 Ile His Ile Ser Gln Glu Asp Asn Val Ala Asn Lys Gln Thr Leu Ala435 440 445 Ser Tyr Arg Ser Lys Ala Thr Lys Glu Arg Asp Lys Leu Leu LysGln 450 455 460 Glu Glu Met Lys Ser Leu Ala Phe Glu Lys Ala Lys Leu LysArg Glu 465 470 475 480 Lys Ala Asp Ala Leu Glu Ala Lys Lys Lys Glu LysGlu Asp Lys Glu 485 490 495 Lys Lys Arg Glu Glu Leu Lys Lys Ile Val GluGlu Glu Arg Leu Lys 500 505 510 Lys Lys Glu Glu Lys Glu Arg Leu Lys ValGlu Arg Glu Lys Glu Arg 515 520 525 Glu Lys Leu Arg Glu Glu Lys Arg LysTyr Val Glu Tyr Lys Gln Trp 530 535 540 Ser Lys Pro Arg Glu Asp Met GluCys Asp Asp Leu Lys Glu Leu Pro 545 550 555 560 Glu Pro Thr Pro Val LysThr Arg Leu Pro Pro Glu Ile Phe Gly Asp 565 570 575 Ala Leu Met Val LeuGlu Phe Leu Asn Ala Phe Gly Glu Leu Phe Asp 580 585 590 Leu Gln Asp GluPhe Pro Asp Gly Val Thr Leu Glu Val Leu Glu Glu 595 600 605 Ala Leu ValGly Asn Asp Ser Glu Gly Pro Leu Cys Glu Leu Leu Phe 610 615 620 Phe PheLeu Thr Ala Ile Phe Gln Ala Ile Ala Glu Glu Glu Glu Glu 625 630 635 640Val Ala Lys Glu Gln Leu Thr Asp Ala Asp Thr Lys Gly Cys Ser Leu 645 650655 Lys Ser Leu Asp Leu Asp Ser Cys Thr Leu Ser Glu Ile Leu Arg Leu 660665 670 His Ile Leu Ala Ser Gly Ala Asp Val Thr Ser Ala Asn Ala Lys Tyr675 680 685 Arg Tyr Gln Lys Arg Gly Gly Phe Asp Ala Thr Asp Asp Ala CysMet 690 695 700 Glu Leu Arg Leu Ser Asn Pro Ser Leu Val Lys Lys Leu SerSer Thr 705 710 715 720 Ser Val Tyr Asp Leu Thr Pro Gly Glu Lys Met LysIle Leu His Ala 725 730 735 Leu Cys Gly Lys Leu Leu Thr Leu Val Ser ThrArg Asp Phe Ile Glu 740 745 750 Asp Tyr Val Asp Ile Leu Arg Gln Ala LysGln Glu Phe Arg Glu Leu 755 760 765 Lys Ala Glu Gln His Arg Lys Glu ArgGlu Glu Ala Ala Ala Arg Ile 770 775 780 Arg Lys Arg Lys Glu Glu Lys LeuLys Glu Gln Glu Gln Lys Met Lys 785 790 795 800 Glu Lys Gln Glu Lys LeuLys Glu Asp Glu Gln Arg Asn Ser Thr Ala 805 810 815 Asp Ile Ser Ile GlyGlu Glu Glu Arg Glu Asp Phe Asp Thr Ser Ile 820 825 830 Glu Ser Lys AspThr Glu Gln Lys Glu Leu Asp Gln Asp Met Phe Thr 835 840 845 Glu Asp GluAsp Asp Pro Gly Ser His Lys Arg Gly Arg Arg Gly Lys 850 855 860 Arg GlyGln Asn Gly Phe Lys Glu Phe Thr Arg Gln Glu Gln Ile Asn 865 870 875 880Cys Val Thr Arg Glu Leu Leu Thr Ala Asp Glu Glu Glu Ala Leu Lys 885 890895 Gln Glu His Gln Arg Lys Glu Lys Glu Leu Leu Glu Lys Leu Gln Ser 900905 910 Ala Ile Ala Cys Thr Asn Ile Phe Pro Leu Gly Arg Asp Arg Met Tyr915 920 925 Arg Arg Tyr Trp Ile Phe Pro Ser Leu Pro Gly Leu Phe Ile GluGlu 930 935 940 Asp Tyr Ser Gly Leu Thr Glu Asp His Leu Leu Pro Arg ProSer Ser 945 950 955 960 Phe Gln Asn Asn Val Gln Ser Gln Asp Pro Gln ValSer Thr Lys Thr 965 970 975 Gly Glu Pro Leu Met Ser Glu Ser Thr Ser AsnIle Asp Gln Gly Pro 980 985 990 Arg Asp His Ser Val Gln Leu Pro Lys ProVal His Lys Pro Asn Arg 995 1000 1005 Trp Cys Phe Tyr Ser Ser Cys GluGln Leu Asp Gln Leu Ile Glu Ala 1010 1015 1020 Leu Asn Ser Arg Gly HisArg Glu Ser Ala Leu Lys Glu Thr Leu Leu 1025 1030 1035 1040 Gln Glu LysSer Arg Ile Cys Ala Gln Leu Ala Arg Phe Ser Glu Glu 1045 1050 1055 LysPhe His Phe Ser Asp Lys Arg Gln Pro Asp Ser Lys Pro Thr Tyr 1060 10651070 Ser Arg Gly Arg Ser Ser Asn Ala Tyr Asp Pro Ser Gln Met Cys Ala1075 1080 1085 Glu Lys Gln Leu Glu Leu Arg Leu Arg Asp Phe Leu Leu AspIle Glu 1090 1095 1100 Asp Arg Ile Tyr Gln Gly Thr Leu Gly Ala Ile LysVal Thr Asp Arg 1105 1110 1115 1120 His Ile Trp Arg Ser Ala Leu Glu SerGly Arg Tyr Glu Leu Leu Ser 1125 1130 1135 Glu Glu Asn Lys Glu Asn GlyIle Ile Lys Thr Val Asn Glu Asp Val 1140 1145 1150 Glu Glu Met Glu IleAsp Glu Gln Thr Lys Val Ile Val Lys Asp Arg 1155 1160 1165 Leu Leu GlyIle Lys Thr Glu Thr Pro Ser Thr Val Ser Thr Asn Ala 1170 1175 1180 SerThr Pro Gln Ser Val Ser Ser Val Val His Tyr Leu Ala Met Ala 1185 11901195 1200 Leu Phe Gln Ile Glu Gln Gly Leu Glu Arg Arg Phe Leu Lys AlaPro 1205 1210 1215 Leu Asp Ala Ser Asp Ser Gly Arg Ser Tyr Lys Thr ValLeu Asp Arg 1220 1225 1230 Trp Arg Glu Ser Leu Leu Ser Ser Ala Ser LeuSer Gln Val Phe Leu 1235 1240 1245 His Leu Ser Thr Leu Asp Arg Ser ValIle Trp Ser Lys Ser Ile Leu 1250 1255 1260 Asn Ala Arg Cys Lys Ile CysArg Lys Lys Gly Asp Ala Glu Asn Met 1265 1270 1275 1280 Val Leu Cys AspGly Cys Asp Arg Gly His His Thr Tyr Cys Val Arg 1285 1290 1295 Pro LysLeu Lys Ile Val Pro Glu Gly Asp Trp Phe Cys Pro Glu Cys 1300 1305 1310Arg Pro Lys Gln Arg Cys Arg Arg Leu Ser Phe Arg Gln Arg Pro Ser 13151320 1325 Leu Glu Ser Asp Glu Asp Val Glu Asp Ser Met Gly Gly Glu AspAsp 1330 1335 1340 Glu Val Asp Gly Asp Glu Glu Glu Gly Gln Ser Glu GluGlu Glu Tyr 1345 1350 1355 1360 Glu Val Glu Gln Asp Glu Asp Asp Ser GlnGlu Glu Glu Glu Val Ser 1365 1370 1375 Leu Pro Lys Arg Gly Arg Pro GlnVal Arg Leu Pro Val Lys Thr Arg 1380 1385 1390 Gly Lys Leu Ser Ser SerPhe Ser Ser Arg Gly Gln Gln Gln Glu Pro 1395 1400 1405 Gly Arg Tyr ProSer Arg Ser Gln Gln Ser Thr Pro Lys Thr Thr Val 1410 1415 1420 Ser SerLys Thr Gly Arg Ser Leu Arg Lys Ile Asn Ser Ala Pro Pro 1425 1430 14351440 Thr Glu Thr Lys Ser Leu Arg Ile Ala Ser Arg Ser Thr Arg His Ser1445 1450 1455 His Gly Pro Leu Gln Ala Asp Val Phe Val Glu Leu Leu SerPro Arg 1460 1465 1470 Arg Lys Arg Arg Gly Arg Lys Ser Ala Asn Asn ThrPro Glu Asn Ser 1475 1480 1485 Pro Asn Phe Pro Asn Phe Arg Val Ile AlaThr Lys Ser Ser Glu Gln 1490 1495 1500 Ser Arg Ser Val Asn Ile Ala SerLys Leu Ser Leu Gln Glu Ser Glu 1505 1510 1515 1520 Ser Lys Arg Arg CysArg Lys Arg Gln Ser Pro Glu Pro Ser Pro Val 1525 1530 1535 Thr Leu GlyArg Arg Ser Ser Gly Arg Gln Gly Gly Val His Glu Leu 1540 1545 1550 SerAla Phe Glu Gln Leu Val Val Glu Leu Val Arg His Asp Asp Ser 1555 15601565 Trp Pro Phe Leu Lys Leu Val Ser Lys Ile Gln Val Pro Asp Tyr Tyr1570 1575 1580 Asp Ile Ile Lys Lys Pro Ile Ala Leu Asn Ile Ile Arg GluLys Val 1585 1590 1595 1600 Asn Lys Cys Glu Tyr Lys Leu Ala Ser Glu PheIle Asp Asp Ile Glu 1605 1610 1615 Leu Met Phe Ser Asn Cys Phe Glu TyrAsn Pro Arg Asn Thr Ser Glu 1620 1625 1630 Ala Lys Ala Gly Thr Arg LeuGln Ala Phe Phe His Ile Gln Ala Gln 1635 1640 1645 Lys Leu Gly Leu HisVal Thr Pro Ser Asn Val Asp Gln Val Ser Thr 1650 1655 1660 Pro Pro AlaAla Lys Lys Ser Arg Ile 1665 1670 <210> SEQ ID NO 70 <211> LENGTH: 1876<212> TYPE: PRT <213> ORGANISM: Homo sapiens <400> SEQUENCE: 70 Met GluMet Glu Ala Asn Glu Ala Asn Asp His Phe Asn Phe Thr Gly 1 5 10 15 LeuPro Pro Ala Pro Ala Ala Ser Gly Leu Lys Pro Ser Pro Ser Ser 20 25 30 GlyGlu Gly Leu Tyr Thr Asn Gly Ser Pro His Asn Phe Pro Gln Gln 35 40 45 GlyLys Ser Leu Asn Gly Asp Val Asn Val Asn Gly Leu Ser Thr Val 50 55 60 SerHis Thr Thr Thr Ser Gly Ile Leu Asn Ser Ala Pro His Ser Ser 65 70 75 80Ser Thr Ser His Leu His His Pro Ser Val Ala Tyr Asp Cys Leu Trp 85 90 95Asn Tyr Ser Gln Tyr Pro Ser Ala Asn Pro Gly Ser Asn Leu Lys Asp 100 105110 Pro Pro Leu Leu Ser Gln Phe Ser Gly Gly Gln Tyr Pro Leu Asn Gly 115120 125 Ile Leu Gly Gly Ser Arg Gln Pro Ser Ser Pro Ser His Asn Thr Asn130 135 140 Leu Arg Ala Gly Ser Gln Lys Phe Trp Ala Asn Gly Thr His SerPro 145 150 155 160 Met Gly Leu Asn Phe Asp Ser Gln Glu Leu Tyr Asp SerPhe Pro Asp 165 170 175 Gln Asn Phe Glu Glu Val Cys Ser Gly Ile His ProAsp Glu Ala Ala 180 185 190 Glu Lys Glu Met Thr Ser Val Val Ala Glu AsnGly Thr Gly Leu Val 195 200 205 Cys Ser Leu Glu Leu Glu Glu Glu Gln ProGlu Leu Lys Met Cys Gly 210 215 220 Tyr Asn Gly Ser Val Pro Ser Val GluSer Leu His Gln Glu Val Ser 225 230 235 240 Val Leu Val Pro Asp Pro ThrVal Ser Cys Leu Asp Asp Pro Ser His 245 250 255 Leu Pro Asp Gln Leu GluAsp Thr Pro Ile Leu Ser Glu Asp Ser Leu 260 265 270 Glu Pro Phe Asn SerLeu Ala Pro Glu Pro Val Ser Gly Gly Leu Tyr 275 280 285 Gly Ile Asp AspThr Glu Leu Met Gly Ala Glu Asp Lys Leu Pro Leu 290 295 300 Glu Asp SerPro Val Ile Ser Ala Leu Asp Cys Pro Ser Leu Asn Asn 305 310 315 320 AlaThr Ala Phe Ser Leu Leu Ala Asp Asp Ser Gln Thr Ser Thr Ser 325 330 335Ile Phe Ala Ser Pro Thr Ser Pro Pro Val Leu Gly Glu Ser Val Leu 340 345350 Gln Asp Asn Ser Phe Asp Leu Asn Asn Gly Ser Asp Ala Glu Gln Glu 355360 365 Glu Met Glu Thr Gln Ser Ser Asp Phe Pro Pro Ser Leu Thr Gln Pro370 375 380 Ala Pro Asp Gln Ser Ser Thr Ile Gln Leu His Pro Ala Thr SerPro 385 390 395 400 Ala Val Ser Pro Thr Thr Ser Pro Ala Val Ser Leu ValVal Ser Pro 405 410 415 Ala Ala Ser Pro Glu Ile Ser Pro Glu Val Cys ProAla Ala Ser Thr 420 425 430 Val Val Ser Pro Ala Val Phe Ser Val Val SerPro Ala Ser Ser Ala 435 440 445 Val Leu Pro Ala Val Ser Leu Glu Val ProLeu Thr Ala Ser Val Thr 450 455 460 Ser Pro Lys Ala Ser Pro Val Thr SerPro Ala Ala Ala Phe Pro Thr 465 470 475 480 Ala Ser Pro Ala Asn Lys AspVal Ser Ser Phe Leu Glu Thr Thr Ala 485 490 495 Asp Val Glu Glu Ile ThrGly Glu Gly Leu Thr Ala Ser Gly Ser Gly 500 505 510 Asp Val Met Arg ArgArg Ile Ala Thr Pro Glu Glu Val Arg Leu Pro 515 520 525 Leu Gln His GlyTrp Arg Arg Glu Val Arg Ile Lys Lys Gly Ser His 530 535 540 Arg Trp GlnGly Glu Thr Trp Tyr Tyr Gly Pro Cys Gly Lys Arg Met 545 550 555 560 LysGln Phe Pro Glu Val Ile Lys Tyr Leu Ser Arg Asn Leu Val His 565 570 575Ser Val Arg Arg Glu His Phe Ser Phe Ser Pro Arg Met Pro Val Gly 580 585590 Asp Phe Phe Glu Glu Arg Asp Thr Pro Glu Gly Leu Gln Trp Val Gln 595600 605 Leu Ser Ala Glu Glu Ile Pro Ser Arg Ile Gln Ala Ile Thr Gly Lys610 615 620 Arg Gly Arg Pro Arg Asn Thr Glu Lys Ala Lys Thr Lys Glu ValPro 625 630 635 640 Lys Val Lys Arg Gly Arg Gly Arg Pro Pro Lys Val LysIle Thr Glu 645 650 655 Leu Leu Asn Lys Thr Asp Asn Arg Pro Leu Lys LysLeu Glu Ala Gln 660 665 670 Glu Thr Leu Asn Glu Glu Asp Lys Ala Lys IleAla Lys Ser Lys Lys 675 680 685 Lys Met Arg Gln Lys Val Gln Arg Gly GluCys Leu Thr Thr Ile Gln 690 695 700 Gly Gln Ala Arg Asn Lys Phe Lys GlnGlu Thr Lys Ser Leu Lys His 705 710 715 720 Lys Glu Ala Lys Lys Lys SerLys Ala Glu Lys Glu Arg Gly Lys Thr 725 730 735 Lys Gln Glu Lys Leu LysGlu Lys Val Lys Arg Glu Lys Lys Glu Lys 740 745 750 Val Lys Lys Glu LysGlu Glu Val Thr Lys Ala Lys Pro Ala Cys Lys 755 760 765 Ala Asp Lys ThrLeu Ala Thr Gln Arg Arg Leu Glu Glu Arg Gln Lys 770 775 780 Gln Gln MetIle Leu Glu Glu Met Lys Lys Pro Thr Glu Asp Met Cys 785 790 795 800 LeuThr Asp His Gln Pro Leu Pro Asp Phe Ser Arg Val Pro Gly Leu 805 810 815Thr Leu Pro Ser Gly Ala Phe Ser Asp Cys Leu Thr Ile Val Glu Phe 820 825830 Leu His Ser Pro Gly Lys Val Leu Gly Phe Asp Pro Ala Lys Asp Val 835840 845 Pro Ser Leu Gly Val Leu Gln Glu Gly Leu Leu Cys Gln Gly Asp Ser850 855 860 Leu Gly Glu Val Gln Asp Leu Leu Val Arg Leu Leu Lys Ala AlaLeu 865 870 875 880 His Asp Pro Gly Phe Pro Ser Tyr Cys Gln Ser Lys LysIle Leu Gly 885 890 895 Glu Lys Val Ser Glu Ile Pro Leu Thr Arg Asp AsnVal Ser Glu Ile 900 905 910 Leu Arg Cys Phe Leu Met Ala Tyr Gly Val GluPro Ala Leu Cys Asp 915 920 925 Arg Leu Arg Thr Gln Pro Phe Gln Ala GlnPro Pro Gln Gln Lys Ala 930 935 940 Ala Val Leu Ala Phe Pro Val His GluLeu Asn Gly Ser Thr Leu Ile 945 950 955 960 Ile Asn Glu Ile Asp Lys ThrLeu Glu Ser Met Ser Ser Tyr Arg Lys 965 970 975 Asn Lys Trp Ile Val GluGly Arg Leu Arg Arg Leu Lys Thr Val Leu 980 985 990 Ala Lys Arg Thr GlyArg Ser Glu Val Glu Met Gly Arg Pro Glu Glu 995 1000 1005 Cys Leu GlyArg Arg Arg Ser Ser Arg Ile Met Glu Glu Thr Ser Gly 1010 1015 1020 MetGlu Glu Glu Glu Glu Glu Glu Ser Ile Ala Ala Val Pro Gly Arg 1025 10301035 1040 Arg Gly Arg Arg Asp Gly Glu Val Asp Ala Thr Ala Ser Ser IlePro 1045 1050 1055 Glu Leu Glu Arg Gln Ile Glu Lys Leu Ser Lys Arg GlnLeu Phe Phe 1060 1065 1070 Arg Lys Lys Leu Leu His Ser Ser Gln Met LeuArg Ala Val Ser Leu 1075 1080 1085 Gly Gln Asp Arg Tyr Arg Arg Arg TyrTrp Val Leu Pro Tyr Leu Ala 1090 1095 1100 Gly Ile Phe Val Glu Gly ThrGlu Gly Asn Leu Val Pro Glu Glu Val 1105 1110 1115 1120 Ile Lys Lys GluThr Asp Ser Leu Lys Val Ala Ala His Ala Ser Leu 1125 1130 1135 Asn ProAla Leu Phe Ser Met Lys Met Glu Leu Ala Gly Ser Asn Thr 1140 1145 1150Thr Ala Ser Ser Pro Ala Arg Ala Arg Ser Arg Pro Leu Lys Thr Lys 11551160 1165 Pro Gly Phe Met Gln Pro Arg Glu Phe Lys Ser Pro Val Arg GlyGln 1170 1175 1180 Asp Ser Glu Gln Pro Gln Ala Gln Leu Gln Pro Glu AlaGln Leu His 1185 1190 1195 1200 Val Pro Ala Gln Pro Gln Pro Gln Leu GlnLeu Gln Leu Gln Ser His 1205 1210 1215 Lys Gly Phe Leu Glu Gln Glu GlySer Pro Leu Ser Leu Gly Gln Ser 1220 1225 1230 Gln His Asp Leu Ser GlnSer Ala Phe Leu Ser Trp Leu Ser Gln Thr 1235 1240 1245 Gln His Ser SerLeu Leu Ser Ser Ser Val Leu Thr Pro Asp Ser Ser 1250 1255 1260 Pro GlyLys Leu Asp Pro Ala Pro Ser Gln Pro Pro Glu Glu Pro Glu 1265 1270 12751280 Pro Asp Glu Ala Glu Ser Ser Pro Asp Leu Gln Ala Phe Trp Phe Asn1285 1290 1295 Ile Ser Ala Gln Met Pro Cys Asn Ala Ala Pro Thr Pro ProLeu Ala 1300 1305 1310 Val Ser Glu Asp Gln Pro Thr Pro Ser Pro Gln GlnLeu Ala Ser Ser 1315 1320 1325 Lys Pro Met Asn Arg Pro Ser Ala Ala AsnPro Cys Ser Pro Val Gln 1330 1335 1340 Phe Ser Ser Thr Pro Leu Ala GlyLeu Ala Pro Lys Arg Arg Ala Gly 1345 1350 1355 1360 Asp Pro Gly Glu MetPro Gln Ser Pro Thr Gly Leu Gly Gln Pro Lys 1365 1370 1375 Arg Arg GlyArg Pro Pro Ser Lys Phe Phe Lys Gln Met Glu Gln Arg 1380 1385 1390 ValLeu Thr Gln Leu Thr Ala Gln Pro Val Pro Pro Glu Met Cys Ser 1395 14001405 Gly Trp Trp Trp Ile Pro Asp Pro Glu Met Leu Asp Ala Met Leu Lys1410 1415 1420 Ala Leu His Pro Arg Gly Ile Arg Glu Lys Ala Leu His LysHis Leu 1425 1430 1435 1440 Asn Lys His Arg Asp Phe Leu Gln Glu Val CysLeu Arg Pro Ser Ala 1445 1450 1455 Asp Pro Ile Pro Glu Pro Arg Gln LeuPro Ala Phe Gln Glu Gly Ile 1460 1465 1470 Met Ser Trp Ser Pro Lys GluLys Thr Tyr Glu Thr Asp Leu Ala Val 1475 1480 1485 Leu Gln Trp Val GluGlu Leu Glu Gln Arg Val Ile Met Ser Asp Leu 1490 1495 1500 Gln Ile ArgGly Trp Thr Cys Pro Ser Pro Asp Ser Thr Arg Glu Asp 1505 1510 1515 1520Leu Ala Tyr Cys Glu His Leu Ser Asp Ser Gln Glu Asp Ile Thr Trp 15251530 1535 Arg Gly Pro Gly Arg Glu Gly Leu Ala Pro Gln Arg Lys Thr ThrAsn 1540 1545 1550 Pro Leu Asp Leu Ala Val Met Arg Leu Ala Ala Leu GluGln Asn Val 1555 1560 1565 Lys Arg Arg Tyr Leu Arg Glu Pro Leu Trp ProThr His Glu Trp Val 1570 1575 1580 Leu Glu Lys Ala Leu Leu Ser Thr ProAsn Gly Ala Pro Glu Gly Thr 1585 1590 1595 1600 Thr Thr Glu Ile Ser TyrGlu Ile Thr Pro Arg Ile Arg Ile Trp Arg 1605 1610 1615 Gln Thr Leu GlnArg Cys Arg Ser Ala Ala His Val Cys Leu Cys Leu 1620 1625 1630 Gly HisLeu Glu Arg Ser Ile Ala Trp Glu Lys Ser Val Asn Lys Val 1635 1640 1645Thr Cys Leu Val Cys Arg Lys Gly Asp Asn Asp Glu Phe Leu Leu Leu 16501655 1660 Cys Asp Gly Cys Asp Arg Gly Cys His Ile Tyr Cys His Arg ProLys 1665 1670 1675 1680 Met Glu Ala Val Pro Glu Gly Asp Trp Phe Cys ThrVal Cys Leu Ala 1685 1690 1695 Gln Gln Val Glu Gly Glu Phe Thr Gln LysPro Gly Phe Pro Lys Arg 1700 1705 1710 Gly Gln Lys Arg Lys Ser Gly TyrSer Leu Asn Phe Ser Glu Gly Asp 1715 1720 1725 Gly Arg Arg Arg Arg ValLeu Leu Lys Gly Arg Glu Ser Pro Ala Ala 1730 1735 1740 Gly Pro Arg TyrSer Glu Glu Arg Leu Ser Pro Ser Lys Arg Arg Pro 1745 1750 1755 1760 LeuSer Met Arg Asn His His Ser Asp Leu Thr Phe Cys Glu Ile Ile 1765 17701775 Leu Met Glu Met Glu Ser His Asp Ala Ala Trp Pro Phe Leu Glu Pro1780 1785 1790 Val Asn Pro Arg Leu Val Ser Gly Tyr Arg Arg Ile Ile LysAsn Pro 1795 1800 1805 Met Asp Phe Ser Thr Met Arg Glu Arg Leu Leu ArgGly Gly Tyr Thr 1810 1815 1820 Ser Ser Glu Glu Phe Ala Ala Asp Ala LeuLeu Val Phe Asp Asn Cys 1825 1830 1835 1840 Gln Thr Phe Asn Glu Asp AspSer Glu Val Gly Lys Ala Gly His Ile 1845 1850 1855 Met Arg Arg Phe PheGlu Ser Arg Trp Glu Glu Phe Tyr Gln Gly Lys 1860 1865 1870 Gln Ala AsnLeu 1875 <210> SEQ ID NO 71 <211> LENGTH: 1969 <212> TYPE: PRT <213>ORGANISM: Homo sapiens <400> SEQUENCE: 71 Met Gly Gln Thr Lys Ser ThrSer Ser Gly Gly Gly Asn Arg Lys Cys 1 5 10 15 Asn Gln Glu Gln Ser LysAsn Gln Pro Leu Asp Ala Arg Val Asp Lys 20 25 30 Ile Lys Asp Lys Lys ProArg Lys Lys Ala Met Glu Ser Ser Ser Asn 35 40 45 Ser Asp Ser Asp Ser GlyThr Ser Ser Asp Thr Ser Ser Glu Gly Ile 50 55 60 Ser Ser Ser Asp Ser AspAsp Leu Glu Glu Asp Glu Glu Glu Glu Asp 65 70 75 80 Gln Ser Ile Glu GluSer Glu Asp Asp Asp Ser Asp Ser Glu Ser Glu 85 90 95 Ala Gln His Lys SerAsn Asn Gln Val Leu Leu His Gly Ile Ser Asp 100 105 110 Pro Lys Ala AspGly Gln Lys Ala Thr Glu Lys Ala Gln Glu Lys Arg 115 120 125 Ile His GlnPro Leu Pro Leu Ala Phe Glu Ser Gln Thr His Ser Phe 130 135 140 Gln SerGln Gln Lys Gln Pro Gln Val Leu Ser Gln Gln Leu Pro Phe 145 150 155 160Ile Phe Gln Ser Ser Gln Ala Lys Glu Glu Ser Val Asn Lys His Thr 165 170175 Ser Val Ile Gln Ser Thr Gly Leu Val Ser Asn Val Lys Pro Leu Ser 180185 190 Leu Val Asn Gln Ala Lys Lys Glu Thr Tyr Met Lys Leu Ile Val Pro195 200 205 Ser Pro Asp Val Leu Lys Ala Gly Asn Lys Asn Thr Ser Glu GluSer 210 215 220 Ser Leu Leu Thr Ser Glu Leu Arg Ser Lys Arg Glu Gln TyrLys Gln 225 230 235 240 Ala Phe Pro Ser Gln Leu Lys Lys Gln Glu Ser SerLys Ser Leu Lys 245 250 255 Lys Val Ile Ala Ala Leu Ser Asn Pro Lys AlaThr Ser Ser Ser Pro 260 265 270 Ala His Pro Lys Gln Thr Leu Glu Asn AsnHis Pro Asn Pro Phe Leu 275 280 285 Thr Asn Ala Leu Leu Gly Asn His GlnPro Asn Gly Val Ile Gln Ser 290 295 300 Val Ile Gln Glu Ala Pro Leu AlaLeu Thr Thr Lys Thr Lys Met Gln 305 310 315 320 Ser Lys Ile Asn Glu AsnIle Ala Ala Ala Ser Ser Thr Pro Phe Ser 325 330 335 Ser Pro Val Asn LeuSer Thr Ser Gly Arg Arg Thr Pro Gly Asn Gln 340 345 350 Thr Pro Val MetPro Ser Ala Ser Pro Ile Leu His Ser Gln Gly Lys 355 360 365 Glu Lys AlaVal Ser Asn Asn Val Asn Pro Val Lys Thr Gln His His 370 375 380 Ser HisPro Ala Lys Ser Leu Val Glu Gln Phe Arg Gly Thr Asp Ser 385 390 395 400Asp Ile Pro Ser Ser Lys Asp Ser Glu Asp Ser Asn Glu Asp Glu Glu 405 410415 Glu Asp Asp Glu Glu Glu Asp Glu Glu Asp Glu Asp Asp Glu Ser Asp 420425 430 Asp Ser Gln Ser Glu Ser Asp Ser Asn Ser Glu Ser Asp Thr Glu Gly435 440 445 Ser Glu Glu Glu Asp Asp Asp Asp Lys Asp Gln Asp Glu Ser AspSer 450 455 460 Asp Thr Glu Gly Glu Lys Thr Ser Met Lys Leu Asn Lys ThrThr Ser 465 470 475 480 Ser Lys Ser Pro Ser Met Ser Leu Thr Gly His SerThr Pro Arg Asn 485 490 495 Leu His Ile Ala Lys Ala Pro Gly Ser Ala ProAla Ala Leu Cys Ser 500 505 510 Glu Ser Gln Ser Pro Ala Phe Leu Gly ThrSer Ser Ser Thr Leu Thr 515 520 525 Ser Ser Pro His Ser Gly Thr Ser LysArg Arg Arg Val Thr Asp Glu 530 535 540 Arg Glu Leu Arg Leu Pro Leu GluTyr Gly Trp Gln Arg Glu Thr Arg 545 550 555 560 Ile Arg Asn Phe Gly GlyArg Leu Gln Gly Glu Val Ala Tyr Tyr Ala 565 570 575 Pro Cys Gly Lys LysLeu Arg Gln Tyr Pro Glu Val Ile Lys Tyr Leu 580 585 590 Ser Arg Asn GlyIle Met Asp Ile Ser Arg Asp Asn Phe Ser Phe Ser 595 600 605 Ala Lys IleArg Val Gly Asp Phe Tyr Glu Ala Arg Asp Gly Pro Gln 610 615 620 Glu MetGln Trp Cys Leu Leu Lys Glu Glu Asp Val Ile Pro Arg Ile 625 630 635 640Arg Ala Met Glu Gly Arg Arg Gly Arg Pro Pro Asn Pro Asp Arg Gln 645 650655 Arg Ala Arg Glu Glu Ser Arg Met Arg Arg Arg Lys Gly Arg Pro Pro 660665 670 Asn Val Gly Asn Ala Glu Phe Leu Asp Asn Ala Asp Ala Lys Leu Leu675 680 685 Arg Lys Leu Gln Ala Gln Glu Ala Arg Gln Ala Ala Gln Ile LysLeu 690 695 700 Leu Arg Lys Leu Gln Lys Gln Glu Gln Ala Arg Val Ala LysGlu Ala 705 710 715 720 Lys Lys Gln Gln Ala Ile Met Ala Ala Glu Glu LysArg Lys Gln Lys 725 730 735 Glu Gln Ile Lys Ile His Lys Gln Gln Glu LysIle Lys Arg Ile Gln 740 745 750 Gln Ile Arg Met Glu Lys Glu Leu Arg AlaGln Gln Ile Leu Glu Ala 755 760 765 Lys Lys Lys Lys Lys Glu Glu Ala AlaAsn Ala Lys Leu Leu Glu Ala 770 775 780 Glu Lys Arg Ile Lys Glu Arg GluMet Arg Arg Gln Gln Ala Val Leu 785 790 795 800 Leu Lys Arg Gln Glu ArgGlu Arg Arg Arg Gln His Met Met Leu Met 805 810 815 Lys Ala Met Glu AlaArg Lys Lys Ala Glu Glu Lys Glu Arg Leu Lys 820 825 830 Gln Glu Lys ArgAsp Glu Lys Arg Leu Asn Lys Glu Arg Lys Leu Glu 835 840 845 Gln Arg ArgLeu Glu Leu Glu Met Ala Lys Glu Leu Lys Lys Pro Asn 850 855 860 Glu AspMet Cys Leu Ala Asp Gln Lys Pro Leu Pro Glu Leu Pro Arg 865 870 875 880Ile Pro Gly Leu Val Leu Ser Gly Ser Thr Phe Ser Asp Cys Leu Met 885 890895 Val Val Gln Phe Leu Arg Asn Phe Gly Lys Val Leu Gly Phe Asp Val 900905 910 Asn Ile Asp Val Pro Asn Leu Ser Val Leu Gln Glu Gly Ile Leu Leu915 920 925 Asn Ile Gly Asp Ser Met Gly Glu Val Gln Asp Leu Leu Val ArgLeu 930 935 940 Leu Ser Ala Ala Val Cys Asp Pro Gly Leu Ile Thr Gly TyrLys Ala 945 950 955 960 Lys Thr Ala Leu Gly Glu His Leu Leu Asn Val GlyVal Asn Arg Asp 965 970 975 Asn Val Ser Glu Ile Leu Gln Ile Phe Met GluAla His Cys Gly Gln 980 985 990 Thr Glu Leu Thr Glu Ser Leu Lys Thr LysAla Phe Gln Ala His Thr 995 1000 1005 Pro Ala Gln Lys Ala Val Leu AlaPhe Leu Ile Asn Glu Leu Ala Cys 1010 1015 1020 Ser Lys Ser Val Val SerGlu Ile Asp Lys Asn Ile Asp Tyr Met Ser 1025 1030 1035 1040 Asn Leu ArgArg Asp Lys Asn Val Val Glu Gly Lys Leu Arg Lys Leu 1045 1050 1055 ArgIle Ile His Ala Lys Lys Thr Gly Lys Arg Asp Thr Ser Gly Gly 1060 10651070 Ile Asp Leu Gly Glu Glu Gln His Pro Leu Gly Thr Pro Thr Pro Gly1075 1080 1085 Arg Lys Arg Arg Arg Lys Gly Gly Asp Ser Asp Tyr Asp AspAsp Asp 1090 1095 1100 Asp Asp Asp Ser Asp Asp Gln Gly Asp Glu Asp AspGlu Asp Glu Glu 1105 1110 1115 1120 Asp Lys Glu Asp Gln Lys Gly Lys LysThr Asp Ile Cys Glu Asp Glu 1125 1130 1135 Asp Glu Gly Asp Gln Ala AlaSer Val Glu Glu Leu Glu Lys Gln Ile 1140 1145 1150 Glu Lys Leu Ser LysGln Gln Ser Gln Tyr Arg Arg Lys Leu Phe Asp 1155 1160 1165 Ala Ser HisSer Leu Arg Ser Val Met Phe Gly Pro Asp Arg Tyr Arg 1170 1175 1180 ArgArg Tyr Trp Ile Leu Pro Arg Cys Gly Gly Ile Phe Val Glu Gly 1185 11901195 1200 Met Glu Ser Gly Glu Gly Leu Glu Glu Ile Ala Lys Glu Arg GluLys 1205 1210 1215 Leu Lys Lys Ala Glu Ser Val Gln Ile Lys Glu Glu MetPhe Glu Thr 1220 1225 1230 Ser Gly Asp Ser Leu Asn Cys Ser Asn Thr AspHis Cys Glu Gln Lys 1235 1240 1245 Glu Asp Leu Lys Glu Lys Asp Asn ThrAsn Leu Phe Leu Gln Lys Pro 1250 1255 1260 Gly Ser Phe Ser Lys Leu SerLys Leu Leu Glu Val Ala Lys Met Pro 1265 1270 1275 1280 Pro Glu Ser GluVal Met Thr Pro Lys Pro Asn Ala Gly Ala Asn Gly 1285 1290 1295 Cys ThrLeu Ser Tyr Gln Asn Ser Gly Lys His Ser Leu Gly Ser Val 1300 1305 1310Gln Ser Thr Ala Thr Gln Ser Asn Val Glu Lys Ala Asp Ser Asn Asn 13151320 1325 Leu Phe Asn Thr Gly Ser Ser Gly Pro Gly Lys Phe Tyr Ser ProLeu 1330 1335 1340 Pro Asn Asp Gln Leu Leu Lys Thr Leu Thr Glu Lys AsnArg Gln Trp 1345 1350 1355 1360 Phe Ser Leu Leu Pro Arg Thr Pro Cys AspAsp Thr Ser Leu Thr His 1365 1370 1375 Ala Asp Met Ser Thr Ala Ser LeuVal Thr Pro Gln Ser Gln Pro Pro 1380 1385 1390 Ser Lys Ser Pro Ser ProThr Pro Ala Pro Leu Gly Ser Ser Ala Gln 1395 1400 1405 Asn Pro Val GlyLeu Asn Pro Phe Ala Leu Ser Pro Leu Gln Val Lys 1410 1415 1420 Gly GlyVal Ser Met Met Gly Leu Gln Phe Cys Gly Trp Pro Thr Gly 1425 1430 14351440 Val Val Thr Ser Asn Ile Pro Phe Thr Leu Ser Val Pro Ser Leu Gly1445 1450 1455 Ser Gly Leu Gly Leu Ser Glu Gly Asn Gly Asn Ser Phe LeuThr Ser 1460 1465 1470 Asn Val Ala Ser Ser Lys Ser Glu Ser Pro Val ProGln Asn Glu Lys 1475 1480 1485 Ala Thr Ser Ala Gln Pro Ala Ala Val GluVal Ala Lys Pro Val Asp 1490 1495 1500 Phe Pro Ser Pro Lys Pro Ile ProGlu Glu Met Gln Phe Gly Trp Trp 1505 1510 1515 1520 Arg Ile Ile Asp ProGlu Asp Leu Lys Ala Leu Leu Lys Val Leu His 1525 1530 1535 Leu Arg GlyIle Arg Glu Lys Ala Leu Gln Lys Gln Ile Gln Lys His 1540 1545 1550 LeuAsp Tyr Ile Thr Gln Ala Cys Leu Lys Asn Lys Asp Val Ala Ile 1555 15601565 Ile Glu Leu Asn Glu Asn Glu Glu Asn Gln Val Thr Arg Asp Ile Val1570 1575 1580 Glu Asn Trp Ser Val Glu Glu Gln Ala Met Glu Met Asp LeuSer Val 1585 1590 1595 1600 Leu Gln Gln Val Glu Asp Leu Glu Arg Arg ValAla Ser Ala Ser Leu 1605 1610 1615 Gln Val Lys Gly Trp Met Cys Pro GluPro Ala Ser Glu Arg Glu Asp 1620 1625 1630 Leu Val Tyr Phe Glu His LysSer Phe Thr Lys Leu Cys Lys Glu His 1635 1640 1645 Asp Gly Glu Phe ThrGly Glu Asp Glu Ser Ser Ala His Ala Leu Glu 1650 1655 1660 Arg Lys SerAsp Asn Pro Leu Asp Ile Ala Val Thr Arg Leu Ala Asp 1665 1670 1675 1680Leu Glu Arg Asn Ile Glu Arg Arg Ile Glu Glu Asp Ile Ala Pro Gly 16851690 1695 Leu Arg Val Trp Arg Arg Ala Leu Ser Glu Ala Arg Ser Ala AlaGln 1700 1705 1710 Val Ala Leu Cys Ile Gln Gln Leu Gln Lys Ser Ile AlaTrp Glu Lys 1715 1720 1725 Ser Ile Met Lys Val Tyr Cys Gln Ile Cys ArgLys Gly Asp Asn Glu 1730 1735 1740 Glu Leu Leu Leu Leu Cys Asp Gly CysAsp Lys Gly Cys His Thr Tyr 1745 1750 1755 1760 Cys His Arg Pro Lys IleThr Thr Ile Pro Asp Gly Asp Trp Phe Cys 1765 1770 1775 Pro Ala Cys IleAla Lys Ala Ser Gly Gln Thr Leu Lys Ile Lys Lys 1780 1785 1790 Leu HisVal Lys Gly Lys Lys Thr Asn Glu Ser Lys Lys Gly Lys Lys 1795 1800 1805Val Thr Leu Thr Gly Asp Thr Glu Asp Glu Asp Ser Ala Ser Thr Ser 18101815 1820 Ser Ser Leu Lys Arg Gly Asn Lys Asp Leu Gln Lys Arg Lys MetGlu 1825 1830 1835 1840 Glu Asn Thr Ser Ile Asn Leu Ser Lys Gln Glu SerPhe Thr Ser Val 1845 1850 1855 Lys Lys Pro Lys Arg Asp Asp Ser Lys AspLeu Ala Leu Cys Ser Met 1860 1865 1870 Ile Leu Thr Glu Met Glu Thr HisGlu Asp Ala Trp Pro Phe Leu Leu 1875 1880 1885 Pro Val Asn Leu Lys LeuVal Pro Gly Tyr Lys Lys Val Ile Lys Lys 1890 1895 1900 Pro Met Asp PheSer Thr Ile Arg Glu Lys Leu Ser Ser Gly Gln Tyr 1905 1910 1915 1920 ProAsn Leu Glu Thr Phe Ala Leu Asp Val Arg Leu Val Phe Asp Asn 1925 19301935 Cys Glu Thr Phe Trp Glu Asp Asp Ser Asp Ile Gly Arg Ala Gly His1940 1945 1950 Asn Met Arg Lys Tyr Phe Glu Lys Lys Trp Thr Asp Thr PheLys Val 1955 1960 1965 Ser <210> SEQ ID NO 72 <211> LENGTH: 12 <212>TYPE: DNA <213> ORGANISM: Homo sapiens <400> SEQUENCE: 72 tacagaccct cc12

What is claimed is:
 1. A substantially pure polypeptide comprising anamino acid sequence at least 60% identical to any one of SEQ ID NOs: 1,13, 21, 27, or 29, wherein the polypeptide regulates transcription of agene and comprises a bromodomain.
 2. The polypeptide of claim 1, whereinthe amino acid sequence is at least 70% identical to any one of SEQ IDNOs: 1, 13, 21, 27, or
 29. 3. The polypeptide of claim 1, wherein theamino acid sequence is at least 80% identical to any one of SEQ ID NOs:1, 13, 21, 27, or
 29. 4. The polypeptide of claim 1, wherein the aminoacid sequence is at least 90% identical to any one of SEQ ID NOs: 1, 13,21, 27 or
 29. 5. A substantially pure polypeptide comprising any one ofSEQ ID NOs: 1, 13, 21, 27, or
 29. 6. A substantially pure polypeptidecomprising the amino acid sequence of any one of SEQ ID NOs: 1, 13, 21,27, or 29, with up to 30 conservative amino acid substitutions, whereinthe polypeptide regulates transcription of a gene and comprises abromodomain.
 7. A substantially pure polypeptide encoded by a nucleicacid that hybridizes under high stringency conditions to a probe thesequence of which consists of any one of SEQ ID NOs: 2, 14, 22, 28, or30, wherein the polypeptide regulates transcription of a gene andcomprises a bromodomain.
 8. An isolated nucleic acid encoding thepolypeptide of claim
 1. 9. An isolated nucleic acid encoding thepolypeptide of claim
 5. 10. An isolated nucleic acid encoding thepolypeptide of claim
 6. 11. An isolated nucleic acid comprising a strandthat hybridizes under high stringency conditions to a single strandedprobe consisting of any one of SEQ ID NOs: 2, 14, 22, 28, or
 30. 12. Theisolated nucleic acid of claim 11, wherein the nucleic acid encodes apolypeptide that regulates transcription of a gene and comprises abromodomain.
 13. The nucleic acid of claim 12, wherein the polypeptidecomprises any one of SEQ ID NOs: 1, 13, 21, 27, or
 29. 14. The nucleicacid of claim 11, wherein the strand is at least 15 nucleotides inlength.
 15. A vector comprising the nucleic acid of claim
 8. 16. Avector comprising the nucleic acid of claim
 9. 17. A vector comprisingthe nucleic acid of claim
 10. 18. A vector comprising the nucleic acidof claim
 11. 19. A vector comprising the nucleic acid of claim
 12. 20. Acultured host cell comprising the nucleic acid of claim
 8. 21. Acultured host cell comprising the nucleic acid of claim
 9. 22. Acultured host cell comprising the nucleic acid of claim
 10. 23. Acultured host cell comprising the nucleic acid of claim
 11. 24. Acultured host cell comprising the nucleic acid of claim
 12. 25. Anantibody that specifically binds to the polypeptide of claim
 1. 26. Amethod of preparing a polypeptide, the method comprising culturing thehost cell of claim 20, wherein the host cell expresses the polypeptide,and isolating the polypeptide from the host cell.
 27. A method ofscreening for a compound that binds to a polypeptide, the methodcomprising providing a polypeptide comprising an amino acid sequence atleast 60% identical to SEQ ID NO: 1; contacting a test compound with thepolypeptide; and determining whether the test compound has bound to thepolypeptide.
 28. A compound which specifically binds to the polypeptideof claim 1.